Dear Kevin,I have had a play with the tool and I am very impressed. It is a very powerful, free/open spreadsheet tool that can do a similar job as expensive cheminformatics software. While I have not tried large datasets, I am sure the program will be particularly useful for academics and students, and I will certainly try to incorporate it into my lectures/labs. Some thoughts I have are:
(1) Would it be possible to output the results from the descriptor calculation as numerical data. I notice that currently the results will write both text and numerical values in the same cell (i.e. ALOGP gives "ALogP: 0.0; ALogp2: 0.0; AMR: 0.0". Could this be split over 3 columns so that additional manipulation is needed to allow values to be plotted for example?
(2) Would it be possible to add an option to calculate all descriptors or a basic set of descriptors. At present I think one needs to manually select them all.
(3) Looking to the future, would it be very difficult to create the same functionality, but as an Excel addin, with a standalone menu option on the menu bar (like chemaxon software). The fact that one has to alter the VBA security setting to get the software to work initially is no more difficult than adding a custom addin to Excel.
Ps, in Excel 2007 one appears to needs to save the workbook as a .xlsm file to allow the macros to be saved in the spreadsheet. Otherwise the functionality is not present when one reopens the spreadsheet.
(1) Would it be possible to output the results from the descriptor calculation as numerical data. I notice that currently the results will write both text and numerical values in the same cell (i.e. ALOGP gives "ALogP: 0.0; ALogp2: 0.0; AMR: 0.0". Could this be split over 3 columns so that additional manipulation is needed to allow values to be plotted for example?
(1) Would it be possible to output the results from the descriptor calculation as numerical data. I notice that currently the results will write both text and numerical values in the same cell (i.e. ALOGP gives "ALogP: 0.0; ALogp2: 0.0; AMR: 0.0". Could this be split over 3 columns so that additional manipulation is needed to allow values to be plotted for example?(2) Would it be possible to add an option to calculate all descriptors or a basic set of descriptors. At present I think one needs to manually select them all.
| ID | Smiles | ALOGP:ALogP | ALOGP:ALogp2 | ALOGP:AMR | ALOGP | activity |
| 1 | c1ccccc1 | 0 | 0 | 0 |
| ALogP: 0.0; ALogp2: 0.0; AMR: 0.0 |
| 5 | ||||
| 2 | c1cccnc1 | 0 | 0 | 0 |
| ALogP: 0.0; ALogp2: 0.0; AMR: 0.0 |
| 6 | ||||||
| 3 | CCCCCN | -1.551 | 2.405601 | 23.5678 | ALogP: -1.5510000000000002; ALogp2: 2.4056010000000003; AMR: 23.567800000000002 | 4 |
| 4 | CO(=O)CCCN | -0.9357 | 0.875534 | 23.6977 | ALogP: -0.9356999999999999; ALogp2: 0.8755344899999997; AMR: 23.697700000000005 | 2 |
| 5 | c1ccccc1CCC(=O)O | -0.1426 | 0.020335 | 16.6339 | ALogP: -0.14260000000000025; ALogp2: 0.020334760000000073; AMR: 16.6339 | 7 |
| 6 | c1ccccn1CC | 0.3659 | 0.133883 | 12.8885 | ALogP: 0.3659000000000002; ALogp2: 0.13388281000000016; AMR: 12.8885 | 8 |
| 7 | NCC | -0.3972 | 0.157768 | 14.2126 | ALogP: -0.3971999999999998; ALogp2: 0.15776783999999983; AMR: 14.2126 | 9 |
> Kevin,
>
> Better not use ALogP at all. Back at the list you could find reports
> it is not performing well. I think it was decided to be removed /
> deprecated.
>
Here is the comparison I did some time ago. LogKow is an experimental value from an ECOSAR training set.
XLogP is much better http://tinyurl.com/d6belhs
That particular report was on OpenTox dev list , not on the CDK list, sorry.
Regards,
Nina