Publications
Identification of Diverse Cellulose Binding Domains using in silico Prioritisation and High-Throughput Screening
Abstract
In silico pre-screening methods are becoming an increasingly important step in making large scale enzyme screening experiments manageable, particularly in the sampling large protein sequence datasets for maximum diversity. Here, we develop a bioinformatics workflow that utilises automated gene sequence annotation, substrate prediction and structure prediction tools to effectively isolate representative protein sequences, thereby maximising the information gathered on the wider dataset from relatively few in vitro experiments. Included in this workflow is a new tool, ColabAlign, which performs pairwise structural alignments to construct a structure-informed dendrogram, from which representatives are selected based on clustering. The workflow was applied to the identification of cellulose-binding carbohydrate-binding modules (CBMs) suitable for enzyme immobilisation tag development. 47 sequentially and structurally diverse mCherry-CBM fusions were tested for binding against cellulose, chitin and spent coffee grounds (SCG) using a pulldown assay from cell lysate. We successfully identified 5 CBMs with significant binding towards commercial and waste cellulose support materials, suitable for further tag design work. This work also provides the first experimental evidence of chitin binding by one or more members of CBM6 and CBM46.
Product Used
Genes
Related Publications