Show simple item record

dc.contributor.authorSimón Vidal, Lorena
dc.contributor.authorGarcía Calvo, Oihane
dc.contributor.authorOteo, Uxue
dc.contributor.authorArrasate Gil, Sonia
dc.contributor.authorLete Expósito, María Esther
dc.contributor.authorSotomayor Anduiza, María Nuria
dc.contributor.authorGonzález Díaz, Humberto
dc.date.accessioned2023-11-23T11:18:31Z
dc.date.available2023-11-23T11:18:31Z
dc.date.issued2018-06-13
dc.identifier.citationJournal of Chemical Information and Modeling 58(7) : 1384-1396(2018)es_ES
dc.identifier.issn1549-9596 (print)
dc.identifier.issn1549-960X (electronic)
dc.identifier.urihttp://hdl.handle.net/10810/63124
dc.description.abstractMachine Learning (ML) algorithms are gaining importance in the processing of chemical information and modelling of chemical reactivity problems. In this work, we have developed a PTML model combining Perturbation-Theory (PT) and ML algorithms for predicting the yield of a given reaction. For this purpose, we have selected Parham cyclization, which is a general and powerful tool for the synthesis of heterocyclic and carbocyclic compounds. This reaction has both structural (substitution pattern on the substrate, internal electrophile, ring size, etc.) and operational variables (organolithium reagent, solvent, temperature, time, etc.), so predicting the effect of changes on substrate design (internal elelctrophile, halide, etc.) or reaction conditions on the yield is an important task that could help to optimize the reaction design. The PTML model developed uses PT operators to account for perturbations in experimental conditions and/or structural variables of all the molecules involved in a query reaction compared to a reaction of reference. Thus, a dataset of >100 reactions has been collected for different substrates and internal electrophiles, under different reaction conditions, with a wide range of yields (0 – 98%). The best PTML model found using General Linear Regression (GLR) has R = 0.88 in training and R = 0.83 in external validation series for 10000 pairs of query and reference reactions. The PTML model has a final R = 0.95 for all reactions using multiple reactions of reference. We also report a comparative study of linear vs. non-linear PTML models based on Artificial Neural Networks (ANN) algorithms. PTML-ANN models (LNN, MLP, RBF) with R ≈ 0.1 - 0.8 do not outperform the first PMTL model. This result confirms the validity of the linearity of the model. Next, we carried out an experimental and theoretical study of non-reported Parham reactions to illustrate the practical use of the PTML model. A 500000-point simulation and a Hammett analysis of the reactivity space of Parham reactions are also reportedes_ES
dc.description.sponsorshipMinisterio de Economía y Competitividad (CTQ2016-74881-P) / Ministerio de Economía y Competitividad (CTQ2013-41229-P) / Gobierno Vasco (IT1045-16)es_ES
dc.language.isoenges_ES
dc.publisherAmerican Chemical Societyes_ES
dc.relationinfo:eu-repo/grantAgreement/MINECO/CTQ2016-74881-Pes_ES
dc.relationinfo:eu-repo/grantAgreement/MINECO/CTQ2013-41229-Pes_ES
dc.rightsinfo:eu-repo/semantics/openAccesses_ES
dc.subjectcheminformaticses_ES
dc.subjectmachine learninges_ES
dc.subjectLithiumes_ES
dc.subjectmetalationses_ES
dc.subjectcyclizationes_ES
dc.subjectorganolithium compoundes_ES
dc.titlePerturbation-Theory and Machine Learning (PTML). Model for High-Throughput Screening of Parham Reactions: Experimental and Theoretical Studieses_ES
dc.typeinfo:eu-repo/semantics/articlees_ES
dc.rights.holder© 2018, American Chemical Societyes_ES
dc.relation.publisherversionhttps://pubs.acs.org/doi/10.1021/acs.jcim.8b00286es_ES
dc.relation.publisherversionhttps://doi.org/10.1021/acs.jcim.8b00286es_ES
dc.identifier.doi10.1021/acs.jcim.8b00286
dc.departamentoesQuímica orgánica IIes_ES
dc.departamentoeuKimika organikoa IIes_ES


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record