Show simple item record

dc.contributor.authorGilman, J
dc.contributor.authorSingleton, C
dc.contributor.authorTennant, RK
dc.contributor.authorJames, P
dc.contributor.authorHoward, TP
dc.contributor.authorLux, T
dc.contributor.authorParker, DA
dc.contributor.authorLove, J
dc.date.accessioned2019-05-21T09:56:26Z
dc.date.issued2019-04-17
dc.description.abstractWell-characterized promoter collections for synthetic biology applications are not always available in industrially relevant hosts. We developed a broadly applicable method for promoter identification in atypical microbial hosts that requires no a priori understanding of cis-regulatory element structure. This novel approach combines bioinformatic filtering with rapid empirical characterization to expand the promoter toolkit and uses machine learning to improve the understanding of the relationship between DNA sequence and function. Here, we apply the method in Geobacillus thermoglucosidasius, a thermophilic organism with high potential as a synthetic biology chassis for industrial applications. Bioinformatic screening of G. kaustophilus, G. stearothermophilus, G. thermodenitrificans, and G. thermoglucosidasius resulted in the identification of 636 100 bp putative promoters, encompassing the genome-wide design space and lacking known transcription factor binding sites. Eighty of these sequences were characterized in vivo, and activities covered a 2-log range of predictable expression levels. Seven sequences were shown to function consistently regardless of the downstream coding sequence. Partition modeling identified sequence positions upstream of the canonical -35 and -10 consensus motifs that were predicted to strongly influence regulatory activity in Geobacillus, and artificial neural network and partial least squares regression models were derived to assess if there were a simple, forward, quantitative method for in silico prediction of promoter function. However, the models were insufficiently general to predict pre hoc promoter activity in vivo, most probably as a result of the relatively small size of the training data set compared to the size of the modeled design space.en_GB
dc.identifier.citationVol. 8 (5), pp. 1175 - 1186en_GB
dc.identifier.doi10.1021/acssynbio.9b00061
dc.identifier.urihttp://hdl.handle.net/10871/37165
dc.language.isoenen_GB
dc.publisherAmerican Chemical Societyen_GB
dc.relation.urlhttps://www.ncbi.nlm.nih.gov/pubmed/30995831en_GB
dc.rights.embargoreasonUnder embargo until 17 April 2020 in compliance withy publisher policyen_GB
dc.rights© 2019 American Chemical Societyen_GB
dc.subjectGeobacillusen_GB
dc.subjectindustrial chassisen_GB
dc.subjectmodelingen_GB
dc.subjectpromoter designen_GB
dc.titleRapid, Heuristic Discovery and Design of Promoter Collections in Non-Model Microbes for Industrial Applicationsen_GB
dc.typeArticleen_GB
dc.date.available2019-05-21T09:56:26Z
exeter.place-of-publicationUnited Statesen_GB
dc.descriptionThis is the author accepted manusript. The final version is available from American Chemical Society via the DOI in this recorden_GB
dc.descriptionAccession Codes: The sequence data for the four Geobacillus spp. used in this study have been submitted to the NCBI Sequence Read Archive and are available under the accession number PRJNA521450.en_GB
dc.identifier.eissn2161-5063
dc.identifier.journalACS Synthetic Biologyen_GB
dc.rights.urihttp://www.rioxx.net/licenses/all-rights-reserveden_GB
dcterms.dateAccepted2019-04-17
rioxxterms.versionAMen_GB
rioxxterms.licenseref.startdate2019-04-17
rioxxterms.typeJournal Article/Reviewen_GB
refterms.dateFCD2019-05-21T09:54:17Z
refterms.versionFCDAM
refterms.panelAen_GB


Files in this item

This item appears in the following Collection(s)

Show simple item record