1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
|
= link_to "Back to model creation", '/create'
%p
Input files have two columns. Enter in the first column the chemical structure in
%a{:href => "http://en.wikipedia.org/wiki/Simplified_molecular_input_line_entry_specification"} SMILES
format, in the second column the toxic activity. The first line contains a description of the columns and is ignored.
%dl
%dt Classification datasets
%dd Please use 1/0, active/inactive or true/false to indicate active/inactive compounds.
%dt Regression datasets
%dd
Enter a quantitative value. For optimal performance you should
%ul
%li
use
%a{:href => "http://en.wikipedia.org/wiki/Molar_(concentration)"} molar
units
%li enter non-logarithmic values (logarithms are taken internally)
%li avoid 0 activities (will be ignored)
%p
Input files are accepted in
%a{:href => "http://en.wikipedia.org/wiki/Microsoft_Excel"} Excel
and
%a{:href => "http://en.wikipedia.org/wiki/Comma-separated_values"} CSV
formats.
%h3 Excel example
- n = 0
.code
%table
%tr
%td
%th A
%th B
%tr
- n += 1
%th= n
%td SMILES
%td Hamster Carcinogenicity
%tr
- n += 1
%th= n
%td CC(=O)Nc1ccc(O)cc1
%td 1
%tr
- n += 1
%th= n
%td O=c1[nH]cnc2[nH]ncc12
%td 1
%tr
- n += 1
%th= n
%td CCCCNc1cc(cc(c1Oc2ccccc2)S(=O)(=O)N)C(=O)O
%td 1
%tr
- n += 1
%th= n
%td CC(C)(C)NCC(O)COc1cccc2NC(=O)CCc12
%td 1
%tr
- n += 1
%th= n
%td CN(C)CCCC1(OCc2cc(C#N)ccc21)c3ccc(F)cc3
%td 1
%tr
- n += 1
%th= n
%td CCC(CC)CCN1C(=O)CN=C(C2CCCCC2F)c3cc(Cl)ccc13
%td 0
%tr
- n += 1
%th= n
%td CCN(CC)CC(=O)Nc1c(C)cccc1C
%td 0
%tr
- n += 1
%th= n
%td CC(C)(C)NCC(O)COc1cccc2CC(O)C(O)Cc12
%td 0
%tr
- n += 1
%th= n
%td CN1CCCC1c2cccnc2
%td 0
%p
Excel example file for download:
= link_to "hamster_carcinogenicity.xls", "/hamster_carcinogenicity.xls"
%h3 CSV example
.code
%code
%br SMILES, Hamster Carcinogenicity
%br CC(=O)Nc1ccc(O)cc1, 1
%br O=c1[nH]cnc2[nH]ncc12, 1
%br CCCCNc1cc(cc(c1Oc2ccccc2)S(=O)(=O)N)C(=O)O, 1
%br CC(C)(C)NCC(O)COc1cccc2NC(=O)CCc12, 1
%br CN(C)CCCC1(OCc2cc(C#N)ccc21)c3ccc(F)cc3, 1
%br CCC(CC)CCN1C(=O)CN=C(C2CCCCC2F)c3cc(Cl)ccc13, 0
%br CCN(CC)CC(=O)Nc1c(C)cccc1C, 0
%br CC(C)(C)NCC(O)COc1cccc2CC(O)C(O)Cc12, 0
%br CN1CCCC1c2cccnc2, 0
%p
CSV example for download:
= link_to "hamster_carcinogenicity.csv", "/hamster_carcinogenicity.csv"
%p You can create CSV files in Excel: Create a sheet with two columns and export them as CSV file with the "Save As" option from the menu, selecting the CSV (comma delimited) format.
|