fbpx
Wikipedia

Quantitative structure–activity relationship

Quantitative structure–activity relationship models (QSAR models) are regression or classification models used in the chemical and biological sciences and engineering. Like other regression models, QSAR regression models relate a set of "predictor" variables (X) to the potency of the response variable (Y), while classification QSAR models relate the predictor variables to a categorical value of the response variable.

In QSAR modeling, the predictors consist of physico-chemical properties or theoretical molecular descriptors[1][2] of chemicals; the QSAR response-variable could be a biological activity of the chemicals. QSAR models first summarize a supposed relationship between chemical structures and biological activity in a data-set of chemicals. Second, QSAR models predict the activities of new chemicals.[3][4]

Related terms include quantitative structure–property relationships (QSPR) when a chemical property is modeled as the response variable.[5][6] "Different properties or behaviors of chemical molecules have been investigated in the field of QSPR. Some examples are quantitative structure–reactivity relationships (QSRRs), quantitative structure–chromatography relationships (QSCRs) and, quantitative structure–toxicity relationships (QSTRs), quantitative structure–electrochemistry relationships (QSERs), and quantitative structure–biodegradability relationships (QSBRs)."[7]

As an example, biological activity can be expressed quantitatively as the concentration of a substance required to give a certain biological response. Additionally, when physicochemical properties or structures are expressed by numbers, one can find a mathematical relationship, or quantitative structure-activity relationship, between the two. The mathematical expression, if carefully validated[8][9][10][11] can then be used to predict the modeled response of other chemical structures.[12]

A QSAR has the form of a mathematical model:

  • Activity = f(physiochemical properties and/or structural properties) + error

The error includes model error (bias) and observational variability, that is, the variability in observations even on a correct model.

Essential steps in QSAR studies

Principal steps of QSAR/QSPR including (i) Selection of Data set and extraction of structural/empirical descriptors (ii) variable selection, (iii) model construction and (iv) validation evaluation."[7]

SAR and the SAR paradox

The basic assumption for all molecule-based hypotheses is that similar molecules have similar activities. This principle is also called Structure–Activity Relationship (SAR). The underlying problem is therefore how to define a small difference on a molecular level, since each kind of activity, e.g. reaction ability, biotransformation ability, solubility, target activity, and so on, might depend on another difference. Examples were given in the bioisosterism reviews by Patanie/LaVoie[13] and Brown.[14]

In general, one is more interested in finding strong trends. Created hypotheses usually rely on a finite number of chemicals, so care must be taken to avoid overfitting: the generation of hypotheses that fit training data very closely but perform poorly when applied to new data.

The SAR paradox refers to the fact that it is not the case that all similar molecules have similar activities.

Types

Fragment based (group contribution)

Analogously, the "partition coefficient"—a measurement of differential solubility and itself a component of QSAR predictions—can be predicted either by atomic methods (known as "XLogP" or "ALogP") or by chemical fragment methods (known as "CLogP" and other variations). It has been shown that the logP of compound can be determined by the sum of its fragments; fragment-based methods are generally accepted as better predictors than atomic-based methods.[15] Fragmentary values have been determined statistically, based on empirical data for known logP values. This method gives mixed results and is generally not trusted to have accuracy of more than ±0.1 units.[16]

Group or Fragment based QSAR is also known as GQSAR.[17] GQSAR allows flexibility to study various molecular fragments of interest in relation to the variation in biological response. The molecular fragments could be substituents at various substitution sites in congeneric set of molecules or could be on the basis of pre-defined chemical rules in case of non-congeneric sets. GQSAR also considers cross-terms fragment descriptors, which could be helpful in identification of key fragment interactions in determining variation of activity.[17] Lead discovery using Fragnomics is an emerging paradigm. In this context FB-QSAR proves to be a promising strategy for fragment library design and in fragment-to-lead identification endeavours.[18]

An advanced approach on fragment or group-based QSAR based on the concept of pharmacophore-similarity is developed.[19] This method, pharmacophore-similarity-based QSAR (PS-QSAR) uses topological pharmacophoric descriptors to develop QSAR models. This activity prediction may assist the contribution of certain pharmacophore features encoded by respective fragments toward activity improvement and/or detrimental effects.[19]

3D-QSAR

The acronym 3D-QSAR or 3-D QSAR refers to the application of force field calculations requiring three-dimensional structures of a given set of small molecules with known activities (training set). The training set needs to be superimposed (aligned) by either experimental data (e.g. based on ligand-protein crystallography) or molecule superimposition software. It uses computed potentials, e.g. the Lennard-Jones potential, rather than experimental constants and is concerned with the overall molecule rather than a single substituent. The first 3-D QSAR was named Comparative Molecular Field Analysis (CoMFA) by Cramer et al. It examined the steric fields (shape of the molecule) and the electrostatic fields[20] which were correlated by means of partial least squares regression (PLS).

The created data space is then usually reduced by a following feature extraction (see also dimensionality reduction). The following learning method can be any of the already mentioned machine learning methods, e.g. support vector machines.[21] An alternative approach uses multiple-instance learning by encoding molecules as sets of data instances, each of which represents a possible molecular conformation. A label or response is assigned to each set corresponding to the activity of the molecule, which is assumed to be determined by at least one instance in the set (i.e. some conformation of the molecule).[22]

On June 18, 2011 the Comparative Molecular Field Analysis (CoMFA) patent has dropped any restriction on the use of GRID and partial least-squares (PLS) technologies.[citation needed]

Chemical descriptor based

In this approach, descriptors quantifying various electronic, geometric, or steric properties of a molecule are computed and used to develop a QSAR.[23] This approach is different from the fragment (or group contribution) approach in that the descriptors are computed for the system as whole rather than from the properties of individual fragments. This approach is different from the 3D-QSAR approach in that the descriptors are computed from scalar quantities (e.g., energies, geometric parameters) rather than from 3D fields.

An example of this approach is the QSARs developed for olefin polymerization by half sandwich compounds.[24][25]

Modeling

In the literature it can be often found that chemists have a preference for partial least squares (PLS) methods,[citation needed] since it applies the feature extraction and induction in one step.

Data mining approach

Computer SAR models typically calculate a relatively large number of features. Because those lack structural interpretation ability, the preprocessing steps face a feature selection problem (i.e., which structural features should be interpreted to determine the structure-activity relationship). Feature selection can be accomplished by visual inspection (qualitative selection by a human); by data mining; or by molecule mining.

A typical data mining based prediction uses e.g. support vector machines, decision trees, artificial neural networks for inducing a predictive learning model.

Molecule mining approaches, a special case of structured data mining approaches, apply a similarity matrix based prediction or an automatic fragmentation scheme into molecular substructures. Furthermore, there exist also approaches using maximum common subgraph searches or graph kernels.[26][27]

 
QSAR protocol

Matched molecular pair analysis

Typically QSAR models derived from non linear machine learning is seen as a "black box", which fails to guide medicinal chemists. Recently there is a relatively new concept of matched molecular pair analysis[28] or prediction driven MMPA which is coupled with QSAR model in order to identify activity cliffs.[29]

Evaluation of the quality of QSAR models

QSAR modeling produces predictive models derived from application of statistical tools correlating biological activity (including desirable therapeutic effect and undesirable side effects) or physico-chemical properties in QSPR models of chemicals (drugs/toxicants/environmental pollutants) with descriptors representative of molecular structure or properties. QSARs are being applied in many disciplines, for example: risk assessment, toxicity prediction, and regulatory decisions[30] in addition to drug discovery and lead optimization.[31] Obtaining a good quality QSAR model depends on many factors, such as the quality of input data, the choice of descriptors and statistical methods for modeling and for validation. Any QSAR modeling should ultimately lead to statistically robust and predictive models capable of making accurate and reliable predictions of the modeled response of new compounds.

For validation of QSAR models, usually various strategies are adopted:[32]

  1. internal validation or cross-validation (actually, while extracting data, cross validation is a measure of model robustness, the more a model is robust (higher q2) the less data extraction perturb the original model);
  2. external validation by splitting the available data set into training set for model development and prediction set for model predictivity check;
  3. blind external validation by application of model on new external data and
  4. data randomization or Y-scrambling for verifying the absence of chance correlation between the response and the modeling descriptors.

The success of any QSAR model depends on accuracy of the input data, selection of appropriate descriptors and statistical tools, and most importantly validation of the developed model. Validation is the process by which the reliability and relevance of a procedure are established for a specific purpose; for QSAR models validation must be mainly for robustness, prediction performances and applicability domain (AD) of the models.[8][9][11][33][34]

Some validation methodologies can be problematic. For example, leave one-out cross-validation generally leads to an overestimation of predictive capacity. Even with external validation, it is difficult to determine whether the selection of training and test sets was manipulated to maximize the predictive capacity of the model being published.

Different aspects of validation of QSAR models that need attention include methods of selection of training set compounds,[35] setting training set size[36] and impact of variable selection[37] for training set models for determining the quality of prediction. Development of novel validation parameters for judging quality of QSAR models is also important.[11][38][39]

Application

Chemical

One of the first historical QSAR applications was to predict boiling points.[40]

It is well known for instance that within a particular family of chemical compounds, especially of organic chemistry, that there are strong correlations between structure and observed properties. A simple example is the relationship between the number of carbons in alkanes and their boiling points. There is a clear trend in the increase of boiling point with an increase in the number carbons, and this serves as a means for predicting the boiling points of higher alkanes.

A still very interesting application is the Hammett equation, Taft equation and pKa prediction methods.[41]

Biological

The biological activity of molecules is usually measured in assays to establish the level of inhibition of particular signal transduction or metabolic pathways. Drug discovery often involves the use of QSAR to identify chemical structures that could have good inhibitory effects on specific targets and have low toxicity (non-specific activity). Of special interest is the prediction of partition coefficient log P, which is an important measure used in identifying "druglikeness" according to Lipinski's Rule of Five.

While many quantitative structure activity relationship analyses involve the interactions of a family of molecules with an enzyme or receptor binding site, QSAR can also be used to study the interactions between the structural domains of proteins. Protein-protein interactions can be quantitatively analyzed for structural variations resulted from site-directed mutagenesis.[42]

It is part of the machine learning method to reduce the risk for a SAR paradox, especially taking into account that only a finite amount of data is available (see also MVUE). In general, all QSAR problems can be divided into coding[43] and learning.[44]

Applications

(Q)SAR models have been used for risk management. QSARS are suggested by regulatory authorities; in the European Union, QSARs are suggested by the REACH regulation, where "REACH" abbreviates "Registration, Evaluation, Authorisation and Restriction of Chemicals". Regulatory application of QSAR methods includes in silico toxicological assessment of genotoxic impurities.[45] Commonly used QSAR assessment software such as DEREK or CASE Ultra (MultiCASE) is used to genotoxicity of impurity according to ICH M7.[46]

The chemical descriptor space whose convex hull is generated by a particular training set of chemicals is called the training set's applicability domain. Prediction of properties of novel chemicals that are located outside the applicability domain uses extrapolation, and so is less reliable (on average) than prediction within the applicability domain. The assessment of the reliability of QSAR predictions remains a research topic.

The QSAR equations can be used to predict biological activities of newer molecules before their synthesis.

Examples of machine learning tools for QSAR modeling include:[47]

S.No. Name Algorithms External link
1. R RF,SVM, Naïve Bayesian, and ANN "R: The R Project for Statistical Computing".
2. libSVM SVM "LIBSVM -- A Library for Support Vector Machines".
3. Orange RF, SVM, and Naïve Bayesian "Orange Data Mining".
4. RapidMiner SVM, RF, Naïve Bayes, DT, ANN, and k-NN "RapidMiner | #1 Open Source Predictive Analytics Platform".
5. Weka RF, SVM, and Naïve Bayes "Weka 3 - Data Mining with Open Source Machine Learning Software in Java".
6. Knime DT, Naïve Bayes, and SVM "KNIME | Open for Innovation".
7. AZOrange[48] RT, SVM, ANN, and RF "AZCompTox/AZOrange: AstraZeneca add-ons to Orange". GitHub. 2018-09-19.
8. Tanagra SVM, RF, Naïve Bayes, and DT "TANAGRA - A free DATA MINING software for teaching and research".
9. Elki k-NN . Archived from the original on 2016-11-19.
10. MALLET "MALLET homepage".
11. MOA . Archived from the original on 2017-06-19.
12. Deep Chem Logistic Regression, Naive Bayes, RF, ANN, and others "DeepChem". deepchem.io. Retrieved 20 October 2017.
13. alvaModel[49] Regression (OLS, PLS, k-NN, SVM and Consensus) and Classification (LDA/QDA, PLS-DA, k-NN, SVM and Consensus) "alvaModel: a software tool to create QSAR/QSPR models". alvascience.com.

See also

References

  1. ^ Todeschini, Roberto; Consonni, Viviana (2009). Molecular Descriptors for Chemoinformatics. Methods and Principles in Medicinal Chemistry. Vol. 41. Wiley. doi:10.1002/9783527628766. ISBN 978-3-527-31852-0.
  2. ^ Mauri, Andrea; Consonni, Viviana; Todeschini, Roberto (2017). "Molecular Descriptors". Handbook of Computational Chemistry. Springer International Publishing. pp. 2065–2093. doi:10.1007/978-3-319-27282-5_51. ISBN 978-3-319-27282-5.
  3. ^ Roy K, Kar S, Das RN (2015). "Chapter 1.2: What is QSAR? Definitions and Formulism". A primer on QSAR/QSPR modeling: Fundamental Concepts. New York: Springer-Verlag Inc. pp. 2–6. ISBN 978-3-319-17281-1.
  4. ^ Ghasemi, Pérez-Sánchez; Mehri, Pérez-Garrido (2018). "Neural network and deep-learning algorithms used in QSAR studies: merits and drawbacks". Drug Discovery Today. 23 (10): 1784–1790. doi:10.1016/j.drudis.2018.06.016. PMID 29936244. S2CID 49418479.
  5. ^ Nantasenamat C, Isarankura-Na-Ayudhya C, Naenna T, Prachayasittikul V (2009). "A practical overview of quantitative structure-activity relationship". Excli Journal. 8: 74–88. doi:10.17877/DE290R-690.
  6. ^ Nantasenamat C, Isarankura-Na-Ayudhya C, Prachayasittikul V (Jul 2010). "Advances in computational methods to predict the biological activity of compounds". Expert Opinion on Drug Discovery. 5 (7): 633–54. doi:10.1517/17460441.2010.492827. PMID 22823204. S2CID 17622541.
  7. ^ a b Yousefinejad S, Hemmateenejad B (2015). "Chemometrics tools in QSAR/QSPR studies: A historical perspective". Chemometrics and Intelligent Laboratory Systems. 149, Part B: 177–204. doi:10.1016/j.chemolab.2015.06.016.
  8. ^ a b Tropsha A, Gramatica P, Gombar VJ (2003). "The Importance of Being Earnest: Validation is the Absolute Essential for Successful Application and Interpretation of QSPR Models". QSAR Comb. Sci. 22: 69–77. doi:10.1002/qsar.200390007.
  9. ^ a b Gramatica P (2007). "Principles of QSAR models validation: internal and external". QSAR Comb. Sci. 26 (5): 694–701. doi:10.1002/qsar.200610151. hdl:11383/1668881.
  10. ^ Ruusmann, V.; Sild, S.; Maran, U. (2015). "QSAR DataBank repository: open and linked qualitative and quantitative structure–activity relationship models". Journal of Cheminformatics. 7: 32. doi:10.1186/s13321-015-0082-6. PMC 4479250. PMID 26110025.
  11. ^ a b c Chirico N, Gramatica P (Aug 2012). "Real external predictivity of QSAR models. Part 2. New intercomparable thresholds for different validation criteria and the need for scatter plot inspection". Journal of Chemical Information and Modeling. 52 (8): 2044–58. doi:10.1021/ci300084j. PMID 22721530.
  12. ^ Tropsha, Alexander (2010). "Best Practices for QSAR Model Development, Validation, and Exploitation". Molecular Informatics. 29 (6–7): 476–488. doi:10.1002/minf.201000061. ISSN 1868-1743. PMID 27463326. S2CID 23564249.
  13. ^ Patani GA, LaVoie EJ (Dec 1996). "Bioisosterism: A Rational Approach in Drug Design". Chemical Reviews. 96 (8): 3147–3176. doi:10.1021/cr950066q. PMID 11848856.
  14. ^ Brown N (2012). Bioisosteres in Medicinal Chemistry. Weinheim: Wiley-VCH. ISBN 978-3-527-33015-7.
  15. ^ Thompson SJ, Hattotuwagama CK, Holliday JD, Flower DR (2006). "On the hydrophobicity of peptides: Comparing empirical predictions of peptide log P values". Bioinformation. 1 (7): 237–41. doi:10.6026/97320630001237. PMC 1891704. PMID 17597897.
  16. ^ Wildman SA, Crippen GM (1999). "Prediction of physicochemical parameters by atomic contributions". J. Chem. Inf. Comput. Sci. 39 (5): 868–873. doi:10.1021/ci990307l.
  17. ^ a b Ajmani S, Jadhav K, Kulkarni SA. "Group-Based QSAR (G-QSAR)". {{cite journal}}: Cite journal requires |journal= (help)
  18. ^ Manoharan P, Vijayan RS, Ghoshal N (Oct 2010). "Rationalizing fragment based drug discovery for BACE1: insights from FB-QSAR, FB-QSSR, multi objective (MO-QSPR) and MIF studies". Journal of Computer-Aided Molecular Design. 24 (10): 843–64. Bibcode:2010JCAMD..24..843M. doi:10.1007/s10822-010-9378-9. PMID 20740315. S2CID 1171860.
  19. ^ a b Prasanth Kumar S, Jasrai YT, Pandya HA, Rawal RM (November 2013). "Pharmacophore-similarity-based QSAR (PS-QSAR) for group-specific biological activity predictions". Journal of Biomolecular Structure & Dynamics. 33 (1): 56–69. doi:10.1080/07391102.2013.849618. PMID 24266725. S2CID 45364247.
  20. ^ Leach AR (2001). Molecular modelling: principles and applications. Englewood Cliffs, N.J: Prentice Hall. ISBN 978-0-582-38210-7.
  21. ^ Vert JP, Schölkopf B, Tsuda K (2004). Kernel methods in computational biology. Cambridge, Mass: MIT Press. ISBN 978-0-262-19509-6.
  22. ^ Dietterich TG, Lathrop RH, Lozano-Pérez T (1997). "Solving the multiple instance problem with axis-parallel rectangles". Artificial Intelligence. 89 (1–2): 31–71. doi:10.1016/S0004-3702(96)00034-3.
  23. ^ Caruthers JM, Lauterbach JA, Thomson KT, Venkatasubramanian V, Snively CM, Bhan A, Katare S, Oskarsdottir G (2003). "Catalyst design: knowledge extraction from high-throughput experimentation". J. Catal. 216 (1–2): 3776–3777. doi:10.1016/S0021-9517(02)00036-2.
  24. ^ Manz TA, Phomphrai K, Medvedev G, Krishnamurthy BB, Sharma S, Haq J, Novstrup KA, Thomson KT, Delgass WN, Caruthers JM, Abu-Omar MM (Apr 2007). "Structure-activity correlation in titanium single-site olefin polymerization catalysts containing mixed cyclopentadienyl/aryloxide ligation". Journal of the American Chemical Society. 129 (13): 3776–7. doi:10.1021/ja0640849. PMID 17348648.
  25. ^ Manz TA, Caruthers JM, Sharma S, Phomphrai K, Thomson KT, Delgass WN, Abu-Omar MM (2012). "Structure–Activity Correlation for Relative Chain Initiation to Propagation Rates in Single-Site Olefin Polymerization Catalysis". Organometallics. 31 (2): 602–618. doi:10.1021/om200884x.
  26. ^ Gusfield D (1997). Algorithms on strings, trees, and sequences: computer science and computational biology. Cambridge, UK: Cambridge University Press. ISBN 978-0-521-58519-4.
  27. ^ Helma C (2005). Predictive toxicology. Washington, DC: Taylor & Francis. ISBN 978-0-8247-2397-2.
  28. ^ Dossetter AG, Griffen EJ, Leach AG (2013). "Matched molecular pair analysis in drug discovery". Drug Discovery Today. 18 (15–16): 724–31. doi:10.1016/j.drudis.2013.03.003. PMID 23557664.
  29. ^ Sushko Y, Novotarskyi S, Körner R, Vogt J, Abdelaziz A, Tetko IV (2014). "Prediction-driven matched molecular pairs to interpret QSARs and aid the molecular optimization process". Journal of Cheminformatics. 6 (1): 48. doi:10.1186/s13321-014-0048-0. PMC 4272757. PMID 25544551.
  30. ^ Tong W, Hong H, Xie Q, Shi L, Fang H, Perkins R (April 2005). "Assessing QSAR Limitations – A Regulatory Perspective". Current Computer-Aided Drug Design. 1 (2): 195–205. doi:10.2174/1573409053585663.
  31. ^ Dearden JC (2003). "In silico prediction of drug toxicity". Journal of Computer-Aided Molecular Design. 17 (2–4): 119–27. Bibcode:2003JCAMD..17..119D. doi:10.1023/A:1025361621494. PMID 13677480. S2CID 21518449.
  32. ^ Wold S, Eriksson L (1995). "Statistical validation of QSAR results". In Waterbeemd, Han van de (ed.). Chemometric methods in molecular design. Weinheim: VCH. pp. 309–318. ISBN 978-3-527-30044-0.
  33. ^ Roy K (Dec 2007). "On some aspects of validation of predictive quantitative structure-activity relationship models". Expert Opinion on Drug Discovery. 2 (12): 1567–77. doi:10.1517/17460441.2.12.1567. PMID 23488901. S2CID 21305783.
  34. ^ Sahigara, Faizan; Mansouri, Kamel; Ballabio, Davide; Mauri, Andrea; Consonni, Viviana; Todeschini, Roberto (2012). "Comparison of Different Approaches to Define the Applicability Domain of QSAR Models". Molecules. 17 (5): 4791–4810. doi:10.3390/molecules17054791. PMC 6268288. PMID 22534664.
  35. ^ Leonard JT, Roy K (2006). "On selection of training and test sets for the development of predictive QSAR models". QSAR & Combinatorial Science. 25 (3): 235–251. doi:10.1002/qsar.200510161.
  36. ^ Roy PP, Leonard JT, Roy K (2008). "Exploring the impact of size of training sets for the development of predictive QSAR models". Chemometrics and Intelligent Laboratory Systems. 90 (1): 31–42. doi:10.1016/j.chemolab.2007.07.004.
  37. ^ Put R, Vander Heyden Y (Oct 2007). "Review on modelling aspects in reversed-phase liquid chromatographic quantitative structure-retention relationships". Analytica Chimica Acta. 602 (2): 164–72. doi:10.1016/j.aca.2007.09.014. PMID 17933600.
  38. ^ Pratim Roy P, Paul S, Mitra I, Roy K (2009). "On two novel parameters for validation of predictive QSAR models". Molecules. 14 (5): 1660–701. doi:10.3390/molecules14051660. PMC 6254296. PMID 19471190.
  39. ^ Chirico N, Gramatica P (Sep 2011). "Real external predictivity of QSAR models: how to evaluate it? Comparison of different validation criteria and proposal of using the concordance correlation coefficient". Journal of Chemical Information and Modeling. 51 (9): 2320–35. doi:10.1021/ci200211n. PMID 21800825.
  40. ^ Rouvray DH, Bonchev D (1991). Chemical graph theory: introduction and fundamentals. Tunbridge Wells, Kent, England: Abacus Press. ISBN 978-0-85626-454-2.
  41. ^ Fraczkiewicz, R (2013). "In Silico Prediction of Ionization". In Reedijk, J (ed.). Reference Module in Chemistry, Molecular Sciences and Chemical Engineering. Reference Module in Chemistry, Molecular Sciences and Chemical Engineering [Online]. Vol. 5. Amsterdam, The Netherlands: Elsevier. doi:10.1016/B978-0-12-409547-2.02610-X. ISBN 9780124095472.
  42. ^ Freyhult EK, Andersson K, Gustafsson MG (Apr 2003). "Structural modeling extends QSAR analysis of antibody-lysozyme interactions to 3D-QSAR". Biophysical Journal. 84 (4): 2264–72. Bibcode:2003BpJ....84.2264F. doi:10.1016/S0006-3495(03)75032-2. PMC 1302793. PMID 12668435.
  43. ^ Timmerman H, Todeschini R, Consonni V, Mannhold R, Kubinyi H (2002). Handbook of Molecular Descriptors. Weinheim: Wiley-VCH. ISBN 978-3-527-29913-3.
  44. ^ Duda RO, Hart PW, Stork DG (2001). Pattern classification. Chichester: John Wiley & Sons. ISBN 978-0-471-05669-0.
  45. ^ Fioravanzo, E.; Bassan, A.; Pavan, M.; Mostrag-Szlichtyng, A.; Worth, A. P. (2012-04-01). "Role of in silico genotoxicity tools in the regulatory assessment of pharmaceutical impurities". SAR and QSAR in Environmental Research. 23 (3–4): 257–277. doi:10.1080/1062936X.2012.657236. ISSN 1062-936X. PMID 22369620. S2CID 2714861.
  46. ^ ICH M7 Assessment and control of DNA reactive (mutagenic) impurities in pharmaceuticals to limit potential carcinogenic risk - Scientific guideline [1]
  47. ^ Lavecchia A (Mar 2015). "Machine-learning approaches in drug discovery: methods and applications". Drug Discovery Today. 20 (3): 318–31. doi:10.1016/j.drudis.2014.10.012. PMID 25448759.
  48. ^ Stålring JC, Carlsson LA, Almeida P, Boyer S (2011). "AZOrange - High performance open source machine learning for QSAR modeling in a graphical programming environment". Journal of Cheminformatics. 3: 28. doi:10.1186/1758-2946-3-28. PMC 3158423. PMID 21798025.
  49. ^ Mauri, Andrea; Bertola, Matteo (2022). "Alvascience: A New Software Suite for the QSAR Workflow Applied to the Blood–Brain Barrier Permeability". International Journal of Molecular Sciences. 23 (12882): 12882. doi:10.3390/ijms232112882. PMC 9655980. PMID 36361669.

Further reading

  • Selassie CD (2003). "History of Quantitative Structure-Activity Relationships" (PDF). In Abraham DJ (ed.). Burger's medicinal Chemistry and Drug Discovery. Vol. 1 (6th ed.). New York: Wiley. pp. 1–48. ISBN 978-0-471-27401-8.
  • Shityakov S, Puskás I, Roewer N, Förster C, Broscheit J (2014). "Three-dimensional quantitative structure-activity relationship and docking studies in a series of anthocyanin derivatives as cytochrome P450 3A4 inhibitors". Advances and Applications in Bioinformatics and Chemistry. 7: 11–21. doi:10.2147/AABC.S56478. PMC 3970920. PMID 24741320.

External links

  • "The Cheminformatics and QSAR Society". Retrieved 2009-05-11.
  • "The 3D QSAR Server". Retrieved 2011-06-18.
  • Verma, Rajeshwar P.; Hansch, Corwin (2007). . Protocol Exchange. doi:10.1038/nprot.2007.125. Archived from the original on 2007-05-01. Retrieved 2009-05-11. A regression program that has dual databases of over 21,000 QSAR models
  • . Archived from the original on 2009-04-25. Retrieved 2009-05-11. A comprehensive web resource for QSAR modelers
  • Chemoinformatics Tools, Drug Theoretics and Cheminformatics Laboratory
  • Multiscale Conceptual Model Figures for QSARs in Biological and Environmental Science

quantitative, structure, activity, relationship, models, qsar, models, regression, classification, models, used, chemical, biological, sciences, engineering, like, other, regression, models, qsar, regression, models, relate, predictor, variables, potency, resp. Quantitative structure activity relationship models QSAR models are regression or classification models used in the chemical and biological sciences and engineering Like other regression models QSAR regression models relate a set of predictor variables X to the potency of the response variable Y while classification QSAR models relate the predictor variables to a categorical value of the response variable In QSAR modeling the predictors consist of physico chemical properties or theoretical molecular descriptors 1 2 of chemicals the QSAR response variable could be a biological activity of the chemicals QSAR models first summarize a supposed relationship between chemical structures and biological activity in a data set of chemicals Second QSAR models predict the activities of new chemicals 3 4 Related terms include quantitative structure property relationships QSPR when a chemical property is modeled as the response variable 5 6 Different properties or behaviors of chemical molecules have been investigated in the field of QSPR Some examples are quantitative structure reactivity relationships QSRRs quantitative structure chromatography relationships QSCRs and quantitative structure toxicity relationships QSTRs quantitative structure electrochemistry relationships QSERs and quantitative structure biodegradability relationships QSBRs 7 As an example biological activity can be expressed quantitatively as the concentration of a substance required to give a certain biological response Additionally when physicochemical properties or structures are expressed by numbers one can find a mathematical relationship or quantitative structure activity relationship between the two The mathematical expression if carefully validated 8 9 10 11 can then be used to predict the modeled response of other chemical structures 12 A QSAR has the form of a mathematical model Activity f physiochemical properties and or structural properties errorThe error includes model error bias and observational variability that is the variability in observations even on a correct model Contents 1 Essential steps in QSAR studies 2 SAR and the SAR paradox 3 Types 3 1 Fragment based group contribution 3 2 3D QSAR 3 3 Chemical descriptor based 4 Modeling 4 1 Data mining approach 4 2 Matched molecular pair analysis 5 Evaluation of the quality of QSAR models 6 Application 6 1 Chemical 6 2 Biological 6 3 Applications 7 See also 8 References 9 Further reading 10 External linksEssential steps in QSAR studies EditPrincipal steps of QSAR QSPR including i Selection of Data set and extraction of structural empirical descriptors ii variable selection iii model construction and iv validation evaluation 7 SAR and the SAR paradox EditThe basic assumption for all molecule based hypotheses is that similar molecules have similar activities This principle is also called Structure Activity Relationship SAR The underlying problem is therefore how to define a small difference on a molecular level since each kind of activity e g reaction ability biotransformation ability solubility target activity and so on might depend on another difference Examples were given in the bioisosterism reviews by Patanie LaVoie 13 and Brown 14 In general one is more interested in finding strong trends Created hypotheses usually rely on a finite number of chemicals so care must be taken to avoid overfitting the generation of hypotheses that fit training data very closely but perform poorly when applied to new data The SAR paradox refers to the fact that it is not the case that all similar molecules have similar activities Types EditFragment based group contribution Edit Analogously the partition coefficient a measurement of differential solubility and itself a component of QSAR predictions can be predicted either by atomic methods known as XLogP or ALogP or by chemical fragment methods known as CLogP and other variations It has been shown that the logP of compound can be determined by the sum of its fragments fragment based methods are generally accepted as better predictors than atomic based methods 15 Fragmentary values have been determined statistically based on empirical data for known logP values This method gives mixed results and is generally not trusted to have accuracy of more than 0 1 units 16 Group or Fragment based QSAR is also known as GQSAR 17 GQSAR allows flexibility to study various molecular fragments of interest in relation to the variation in biological response The molecular fragments could be substituents at various substitution sites in congeneric set of molecules or could be on the basis of pre defined chemical rules in case of non congeneric sets GQSAR also considers cross terms fragment descriptors which could be helpful in identification of key fragment interactions in determining variation of activity 17 Lead discovery using Fragnomics is an emerging paradigm In this context FB QSAR proves to be a promising strategy for fragment library design and in fragment to lead identification endeavours 18 An advanced approach on fragment or group based QSAR based on the concept of pharmacophore similarity is developed 19 This method pharmacophore similarity based QSAR PS QSAR uses topological pharmacophoric descriptors to develop QSAR models This activity prediction may assist the contribution of certain pharmacophore features encoded by respective fragments toward activity improvement and or detrimental effects 19 3D QSAR Edit The acronym 3D QSAR or 3 D QSAR refers to the application of force field calculations requiring three dimensional structures of a given set of small molecules with known activities training set The training set needs to be superimposed aligned by either experimental data e g based on ligand protein crystallography or molecule superimposition software It uses computed potentials e g the Lennard Jones potential rather than experimental constants and is concerned with the overall molecule rather than a single substituent The first 3 D QSAR was named Comparative Molecular Field Analysis CoMFA by Cramer et al It examined the steric fields shape of the molecule and the electrostatic fields 20 which were correlated by means of partial least squares regression PLS The created data space is then usually reduced by a following feature extraction see also dimensionality reduction The following learning method can be any of the already mentioned machine learning methods e g support vector machines 21 An alternative approach uses multiple instance learning by encoding molecules as sets of data instances each of which represents a possible molecular conformation A label or response is assigned to each set corresponding to the activity of the molecule which is assumed to be determined by at least one instance in the set i e some conformation of the molecule 22 On June 18 2011 the Comparative Molecular Field Analysis CoMFA patent has dropped any restriction on the use of GRID and partial least squares PLS technologies citation needed Chemical descriptor based Edit In this approach descriptors quantifying various electronic geometric or steric properties of a molecule are computed and used to develop a QSAR 23 This approach is different from the fragment or group contribution approach in that the descriptors are computed for the system as whole rather than from the properties of individual fragments This approach is different from the 3D QSAR approach in that the descriptors are computed from scalar quantities e g energies geometric parameters rather than from 3D fields An example of this approach is the QSARs developed for olefin polymerization by half sandwich compounds 24 25 Modeling EditIn the literature it can be often found that chemists have a preference for partial least squares PLS methods citation needed since it applies the feature extraction and induction in one step Data mining approach Edit Computer SAR models typically calculate a relatively large number of features Because those lack structural interpretation ability the preprocessing steps face a feature selection problem i e which structural features should be interpreted to determine the structure activity relationship Feature selection can be accomplished by visual inspection qualitative selection by a human by data mining or by molecule mining A typical data mining based prediction uses e g support vector machines decision trees artificial neural networks for inducing a predictive learning model Molecule mining approaches a special case of structured data mining approaches apply a similarity matrix based prediction or an automatic fragmentation scheme into molecular substructures Furthermore there exist also approaches using maximum common subgraph searches or graph kernels 26 27 QSAR protocol Matched molecular pair analysis Edit Main article Matched molecular pair analysis Typically QSAR models derived from non linear machine learning is seen as a black box which fails to guide medicinal chemists Recently there is a relatively new concept of matched molecular pair analysis 28 or prediction driven MMPA which is coupled with QSAR model in order to identify activity cliffs 29 Evaluation of the quality of QSAR models EditQSAR modeling produces predictive models derived from application of statistical tools correlating biological activity including desirable therapeutic effect and undesirable side effects or physico chemical properties in QSPR models of chemicals drugs toxicants environmental pollutants with descriptors representative of molecular structure or properties QSARs are being applied in many disciplines for example risk assessment toxicity prediction and regulatory decisions 30 in addition to drug discovery and lead optimization 31 Obtaining a good quality QSAR model depends on many factors such as the quality of input data the choice of descriptors and statistical methods for modeling and for validation Any QSAR modeling should ultimately lead to statistically robust and predictive models capable of making accurate and reliable predictions of the modeled response of new compounds For validation of QSAR models usually various strategies are adopted 32 internal validation or cross validation actually while extracting data cross validation is a measure of model robustness the more a model is robust higher q2 the less data extraction perturb the original model external validation by splitting the available data set into training set for model development and prediction set for model predictivity check blind external validation by application of model on new external data and data randomization or Y scrambling for verifying the absence of chance correlation between the response and the modeling descriptors The success of any QSAR model depends on accuracy of the input data selection of appropriate descriptors and statistical tools and most importantly validation of the developed model Validation is the process by which the reliability and relevance of a procedure are established for a specific purpose for QSAR models validation must be mainly for robustness prediction performances and applicability domain AD of the models 8 9 11 33 34 Some validation methodologies can be problematic For example leave one out cross validation generally leads to an overestimation of predictive capacity Even with external validation it is difficult to determine whether the selection of training and test sets was manipulated to maximize the predictive capacity of the model being published Different aspects of validation of QSAR models that need attention include methods of selection of training set compounds 35 setting training set size 36 and impact of variable selection 37 for training set models for determining the quality of prediction Development of novel validation parameters for judging quality of QSAR models is also important 11 38 39 Application EditChemical Edit One of the first historical QSAR applications was to predict boiling points 40 It is well known for instance that within a particular family of chemical compounds especially of organic chemistry that there are strong correlations between structure and observed properties A simple example is the relationship between the number of carbons in alkanes and their boiling points There is a clear trend in the increase of boiling point with an increase in the number carbons and this serves as a means for predicting the boiling points of higher alkanes A still very interesting application is the Hammett equation Taft equation and pKa prediction methods 41 Biological Edit The biological activity of molecules is usually measured in assays to establish the level of inhibition of particular signal transduction or metabolic pathways Drug discovery often involves the use of QSAR to identify chemical structures that could have good inhibitory effects on specific targets and have low toxicity non specific activity Of special interest is the prediction of partition coefficient log P which is an important measure used in identifying druglikeness according to Lipinski s Rule of Five While many quantitative structure activity relationship analyses involve the interactions of a family of molecules with an enzyme or receptor binding site QSAR can also be used to study the interactions between the structural domains of proteins Protein protein interactions can be quantitatively analyzed for structural variations resulted from site directed mutagenesis 42 It is part of the machine learning method to reduce the risk for a SAR paradox especially taking into account that only a finite amount of data is available see also MVUE In general all QSAR problems can be divided into coding 43 and learning 44 Applications Edit Q SAR models have been used for risk management QSARS are suggested by regulatory authorities in the European Union QSARs are suggested by the REACH regulation where REACH abbreviates Registration Evaluation Authorisation and Restriction of Chemicals Regulatory application of QSAR methods includes in silico toxicological assessment of genotoxic impurities 45 Commonly used QSAR assessment software such as DEREK or CASE Ultra MultiCASE is used to genotoxicity of impurity according to ICH M7 46 The chemical descriptor space whose convex hull is generated by a particular training set of chemicals is called the training set s applicability domain Prediction of properties of novel chemicals that are located outside the applicability domain uses extrapolation and so is less reliable on average than prediction within the applicability domain The assessment of the reliability of QSAR predictions remains a research topic The QSAR equations can be used to predict biological activities of newer molecules before their synthesis Examples of machine learning tools for QSAR modeling include 47 S No Name Algorithms External link1 R RF SVM Naive Bayesian and ANN R The R Project for Statistical Computing 2 libSVM SVM LIBSVM A Library for Support Vector Machines 3 Orange RF SVM and Naive Bayesian Orange Data Mining 4 RapidMiner SVM RF Naive Bayes DT ANN and k NN RapidMiner 1 Open Source Predictive Analytics Platform 5 Weka RF SVM and Naive Bayes Weka 3 Data Mining with Open Source Machine Learning Software in Java 6 Knime DT Naive Bayes and SVM KNIME Open for Innovation 7 AZOrange 48 RT SVM ANN and RF AZCompTox AZOrange AstraZeneca add ons to Orange GitHub 2018 09 19 8 Tanagra SVM RF Naive Bayes and DT TANAGRA A free DATA MINING software for teaching and research 9 Elki k NN ELKI Data Mining Framework Archived from the original on 2016 11 19 10 MALLET MALLET homepage 11 MOA MOA Massive Online Analysis Real Time Analytics for Data Streams Archived from the original on 2017 06 19 12 Deep Chem Logistic Regression Naive Bayes RF ANN and others DeepChem deepchem io Retrieved 20 October 2017 13 alvaModel 49 Regression OLS PLS k NN SVM and Consensus and Classification LDA QDA PLS DA k NN SVM and Consensus alvaModel a software tool to create QSAR QSPR models alvascience com See also EditADME Cheminformatics Computer assisted drug design CADD Conformation activity relationship Differential solubility Matched molecular pair analysis Molecular descriptor Molecular design software Partition coefficient Pharmacokinetics Pharmacophore QSAR amp Combinatorial Science Scientific journal Software for molecular mechanics modeling Chemicalize org List of predicted structure based propertiesReferences Edit Todeschini Roberto Consonni Viviana 2009 Molecular Descriptors for Chemoinformatics Methods and Principles in Medicinal Chemistry Vol 41 Wiley doi 10 1002 9783527628766 ISBN 978 3 527 31852 0 Mauri Andrea Consonni Viviana Todeschini Roberto 2017 Molecular Descriptors Handbook of Computational Chemistry Springer International Publishing pp 2065 2093 doi 10 1007 978 3 319 27282 5 51 ISBN 978 3 319 27282 5 Roy K Kar S Das RN 2015 Chapter 1 2 What is QSAR Definitions and Formulism A primer on QSAR QSPR modeling Fundamental Concepts New York Springer Verlag Inc pp 2 6 ISBN 978 3 319 17281 1 Ghasemi Perez Sanchez Mehri Perez Garrido 2018 Neural network and deep learning algorithms used in QSAR studies merits and drawbacks Drug Discovery Today 23 10 1784 1790 doi 10 1016 j drudis 2018 06 016 PMID 29936244 S2CID 49418479 Nantasenamat C Isarankura Na Ayudhya C Naenna T Prachayasittikul V 2009 A practical overview of quantitative structure activity relationship Excli Journal 8 74 88 doi 10 17877 DE290R 690 Nantasenamat C Isarankura Na Ayudhya C Prachayasittikul V Jul 2010 Advances in computational methods to predict the biological activity of compounds Expert Opinion on Drug Discovery 5 7 633 54 doi 10 1517 17460441 2010 492827 PMID 22823204 S2CID 17622541 a b Yousefinejad S Hemmateenejad B 2015 Chemometrics tools in QSAR QSPR studies A historical perspective Chemometrics and Intelligent Laboratory Systems 149 Part B 177 204 doi 10 1016 j chemolab 2015 06 016 a b Tropsha A Gramatica P Gombar VJ 2003 The Importance of Being Earnest Validation is the Absolute Essential for Successful Application and Interpretation of QSPR Models QSAR Comb Sci 22 69 77 doi 10 1002 qsar 200390007 a b Gramatica P 2007 Principles of QSAR models validation internal and external QSAR Comb Sci 26 5 694 701 doi 10 1002 qsar 200610151 hdl 11383 1668881 Ruusmann V Sild S Maran U 2015 QSAR DataBank repository open and linked qualitative and quantitative structure activity relationship models Journal of Cheminformatics 7 32 doi 10 1186 s13321 015 0082 6 PMC 4479250 PMID 26110025 a b c Chirico N Gramatica P Aug 2012 Real external predictivity of QSAR models Part 2 New intercomparable thresholds for different validation criteria and the need for scatter plot inspection Journal of Chemical Information and Modeling 52 8 2044 58 doi 10 1021 ci300084j PMID 22721530 Tropsha Alexander 2010 Best Practices for QSAR Model Development Validation and Exploitation Molecular Informatics 29 6 7 476 488 doi 10 1002 minf 201000061 ISSN 1868 1743 PMID 27463326 S2CID 23564249 Patani GA LaVoie EJ Dec 1996 Bioisosterism A Rational Approach in Drug Design Chemical Reviews 96 8 3147 3176 doi 10 1021 cr950066q PMID 11848856 Brown N 2012 Bioisosteres in Medicinal Chemistry Weinheim Wiley VCH ISBN 978 3 527 33015 7 Thompson SJ Hattotuwagama CK Holliday JD Flower DR 2006 On the hydrophobicity of peptides Comparing empirical predictions of peptide log P values Bioinformation 1 7 237 41 doi 10 6026 97320630001237 PMC 1891704 PMID 17597897 Wildman SA Crippen GM 1999 Prediction of physicochemical parameters by atomic contributions J Chem Inf Comput Sci 39 5 868 873 doi 10 1021 ci990307l a b Ajmani S Jadhav K Kulkarni SA Group Based QSAR G QSAR a href Template Cite journal html title Template Cite journal cite journal a Cite journal requires journal help Manoharan P Vijayan RS Ghoshal N Oct 2010 Rationalizing fragment based drug discovery for BACE1 insights from FB QSAR FB QSSR multi objective MO QSPR and MIF studies Journal of Computer Aided Molecular Design 24 10 843 64 Bibcode 2010JCAMD 24 843M doi 10 1007 s10822 010 9378 9 PMID 20740315 S2CID 1171860 a b Prasanth Kumar S Jasrai YT Pandya HA Rawal RM November 2013 Pharmacophore similarity based QSAR PS QSAR for group specific biological activity predictions Journal of Biomolecular Structure amp Dynamics 33 1 56 69 doi 10 1080 07391102 2013 849618 PMID 24266725 S2CID 45364247 Leach AR 2001 Molecular modelling principles and applications Englewood Cliffs N J Prentice Hall ISBN 978 0 582 38210 7 Vert JP Scholkopf B Tsuda K 2004 Kernel methods in computational biology Cambridge Mass MIT Press ISBN 978 0 262 19509 6 Dietterich TG Lathrop RH Lozano Perez T 1997 Solving the multiple instance problem with axis parallel rectangles Artificial Intelligence 89 1 2 31 71 doi 10 1016 S0004 3702 96 00034 3 Caruthers JM Lauterbach JA Thomson KT Venkatasubramanian V Snively CM Bhan A Katare S Oskarsdottir G 2003 Catalyst design knowledge extraction from high throughput experimentation J Catal 216 1 2 3776 3777 doi 10 1016 S0021 9517 02 00036 2 Manz TA Phomphrai K Medvedev G Krishnamurthy BB Sharma S Haq J Novstrup KA Thomson KT Delgass WN Caruthers JM Abu Omar MM Apr 2007 Structure activity correlation in titanium single site olefin polymerization catalysts containing mixed cyclopentadienyl aryloxide ligation Journal of the American Chemical Society 129 13 3776 7 doi 10 1021 ja0640849 PMID 17348648 Manz TA Caruthers JM Sharma S Phomphrai K Thomson KT Delgass WN Abu Omar MM 2012 Structure Activity Correlation for Relative Chain Initiation to Propagation Rates in Single Site Olefin Polymerization Catalysis Organometallics 31 2 602 618 doi 10 1021 om200884x Gusfield D 1997 Algorithms on strings trees and sequences computer science and computational biology Cambridge UK Cambridge University Press ISBN 978 0 521 58519 4 Helma C 2005 Predictive toxicology Washington DC Taylor amp Francis ISBN 978 0 8247 2397 2 Dossetter AG Griffen EJ Leach AG 2013 Matched molecular pair analysis in drug discovery Drug Discovery Today 18 15 16 724 31 doi 10 1016 j drudis 2013 03 003 PMID 23557664 Sushko Y Novotarskyi S Korner R Vogt J Abdelaziz A Tetko IV 2014 Prediction driven matched molecular pairs to interpret QSARs and aid the molecular optimization process Journal of Cheminformatics 6 1 48 doi 10 1186 s13321 014 0048 0 PMC 4272757 PMID 25544551 Tong W Hong H Xie Q Shi L Fang H Perkins R April 2005 Assessing QSAR Limitations A Regulatory Perspective Current Computer Aided Drug Design 1 2 195 205 doi 10 2174 1573409053585663 Dearden JC 2003 In silico prediction of drug toxicity Journal of Computer Aided Molecular Design 17 2 4 119 27 Bibcode 2003JCAMD 17 119D doi 10 1023 A 1025361621494 PMID 13677480 S2CID 21518449 Wold S Eriksson L 1995 Statistical validation of QSAR results In Waterbeemd Han van de ed Chemometric methods in molecular design Weinheim VCH pp 309 318 ISBN 978 3 527 30044 0 Roy K Dec 2007 On some aspects of validation of predictive quantitative structure activity relationship models Expert Opinion on Drug Discovery 2 12 1567 77 doi 10 1517 17460441 2 12 1567 PMID 23488901 S2CID 21305783 Sahigara Faizan Mansouri Kamel Ballabio Davide Mauri Andrea Consonni Viviana Todeschini Roberto 2012 Comparison of Different Approaches to Define the Applicability Domain of QSAR Models Molecules 17 5 4791 4810 doi 10 3390 molecules17054791 PMC 6268288 PMID 22534664 Leonard JT Roy K 2006 On selection of training and test sets for the development of predictive QSAR models QSAR amp Combinatorial Science 25 3 235 251 doi 10 1002 qsar 200510161 Roy PP Leonard JT Roy K 2008 Exploring the impact of size of training sets for the development of predictive QSAR models Chemometrics and Intelligent Laboratory Systems 90 1 31 42 doi 10 1016 j chemolab 2007 07 004 Put R Vander Heyden Y Oct 2007 Review on modelling aspects in reversed phase liquid chromatographic quantitative structure retention relationships Analytica Chimica Acta 602 2 164 72 doi 10 1016 j aca 2007 09 014 PMID 17933600 Pratim Roy P Paul S Mitra I Roy K 2009 On two novel parameters for validation of predictive QSAR models Molecules 14 5 1660 701 doi 10 3390 molecules14051660 PMC 6254296 PMID 19471190 Chirico N Gramatica P Sep 2011 Real external predictivity of QSAR models how to evaluate it Comparison of different validation criteria and proposal of using the concordance correlation coefficient Journal of Chemical Information and Modeling 51 9 2320 35 doi 10 1021 ci200211n PMID 21800825 Rouvray DH Bonchev D 1991 Chemical graph theory introduction and fundamentals Tunbridge Wells Kent England Abacus Press ISBN 978 0 85626 454 2 Fraczkiewicz R 2013 In Silico Prediction of Ionization In Reedijk J ed Reference Module in Chemistry Molecular Sciences and Chemical Engineering Reference Module in Chemistry Molecular Sciences and Chemical Engineering Online Vol 5 Amsterdam The Netherlands Elsevier doi 10 1016 B978 0 12 409547 2 02610 X ISBN 9780124095472 Freyhult EK Andersson K Gustafsson MG Apr 2003 Structural modeling extends QSAR analysis of antibody lysozyme interactions to 3D QSAR Biophysical Journal 84 4 2264 72 Bibcode 2003BpJ 84 2264F doi 10 1016 S0006 3495 03 75032 2 PMC 1302793 PMID 12668435 Timmerman H Todeschini R Consonni V Mannhold R Kubinyi H 2002 Handbook of Molecular Descriptors Weinheim Wiley VCH ISBN 978 3 527 29913 3 Duda RO Hart PW Stork DG 2001 Pattern classification Chichester John Wiley amp Sons ISBN 978 0 471 05669 0 Fioravanzo E Bassan A Pavan M Mostrag Szlichtyng A Worth A P 2012 04 01 Role of in silico genotoxicity tools in the regulatory assessment of pharmaceutical impurities SAR and QSAR in Environmental Research 23 3 4 257 277 doi 10 1080 1062936X 2012 657236 ISSN 1062 936X PMID 22369620 S2CID 2714861 ICH M7 Assessment and control of DNA reactive mutagenic impurities in pharmaceuticals to limit potential carcinogenic risk Scientific guideline 1 Lavecchia A Mar 2015 Machine learning approaches in drug discovery methods and applications Drug Discovery Today 20 3 318 31 doi 10 1016 j drudis 2014 10 012 PMID 25448759 Stalring JC Carlsson LA Almeida P Boyer S 2011 AZOrange High performance open source machine learning for QSAR modeling in a graphical programming environment Journal of Cheminformatics 3 28 doi 10 1186 1758 2946 3 28 PMC 3158423 PMID 21798025 Mauri Andrea Bertola Matteo 2022 Alvascience A New Software Suite for the QSAR Workflow Applied to the Blood Brain Barrier Permeability International Journal of Molecular Sciences 23 12882 12882 doi 10 3390 ijms232112882 PMC 9655980 PMID 36361669 Further reading EditSelassie CD 2003 History of Quantitative Structure Activity Relationships PDF In Abraham DJ ed Burger s medicinal Chemistry and Drug Discovery Vol 1 6th ed New York Wiley pp 1 48 ISBN 978 0 471 27401 8 Shityakov S Puskas I Roewer N Forster C Broscheit J 2014 Three dimensional quantitative structure activity relationship and docking studies in a series of anthocyanin derivatives as cytochrome P450 3A4 inhibitors Advances and Applications in Bioinformatics and Chemistry 7 11 21 doi 10 2147 AABC S56478 PMC 3970920 PMID 24741320 External links Edit The Cheminformatics and QSAR Society Retrieved 2009 05 11 The 3D QSAR Server Retrieved 2011 06 18 Verma Rajeshwar P Hansch Corwin 2007 Nature Protocols Development of QSAR models using C QSAR program Protocol Exchange doi 10 1038 nprot 2007 125 Archived from the original on 2007 05 01 Retrieved 2009 05 11 A regression program that has dual databases of over 21 000 QSAR models QSAR World Archived from the original on 2009 04 25 Retrieved 2009 05 11 A comprehensive web resource for QSAR modelers Chemoinformatics Tools Drug Theoretics and Cheminformatics Laboratory Multiscale Conceptual Model Figures for QSARs in Biological and Environmental Science Retrieved from https en wikipedia org w index php title Quantitative structure activity relationship amp oldid 1132758916, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.