温冬灼,张 智.耐高温解淀粉芽孢杆菌BA-DES4基因组分析[J].食品安全质量检测学报,2025,16(6):10-16 |
耐高温解淀粉芽孢杆菌BA-DES4基因组分析 |
Genomic analysis of high temperature resistant Bacillus amyloliticus BA-DES4 |
投稿时间:2024-11-10 修订日期:2025-02-27 |
DOI: |
中文关键词: 解淀粉芽孢杆菌 原生质体 诱变 纤维素酶 基因组 |
英文关键词:Bacillus amyloliticus cellulase genome |
基金项目: |
|
|
摘要点击次数: 48 |
全文下载次数: 28 |
中文摘要: |
目的 利用生物软件和数据库对耐高温解淀粉芽孢杆菌BA-DES4的基因组数据进行处理、注释和分析, 并在基因水平上对菌株特征进行功能注释。方法 通过京都基因和基因组百科全书(kyoto encyclopedia of genes and genomes, KEGG)和基因本体论(gene ontology, GO)等代谢信号通路, 蛋白质同源组(clusters of orthologous groups of proteins, COG)、非冗余蛋白质(non-redundant protein, NR)数据库等分析, 分别与预测得到的基因序列做比对, 获得基因功能注释表。将预测得到基因的蛋白序列与COG、KEGG和GO数据库进行蛋白质序列对蛋白质序列库比对(basic local alignment search tool: protein, BLASTP)比对分析, 从而实现基因注释信息预测及功能预测。结果 通过数据库分析可知其碱基数目为4188731, GC含量占比为46.18%, 编码的蛋白基因4445条, 基因总长度为3696380 bp。对基因组蛋白编码基因进行功能注释可知, COG注释基因中占比最高的功能为G(碳水化合物的转运和代谢), 其次为K(转录), 表明COG数据库编码的蛋白基因主要参与细胞的基本功能。GO注释中, 基因种类和数目最多的为生物学过程; 碳水化合物活性酶(carbohydrate-active enzymes, CAZy)注释中, 水解酶占比最高; KEGG注释中, 碳水化合物代谢数目最多; 在环境信息中, 信息转导占比最多。耐高温解淀粉芽孢杆菌BA-DES4共11条编码纤维素酶的基因, β-葡萄糖苷酶和内切葡聚糖酶为基因编码的酶。结论 本研究对耐高温解淀粉芽孢杆菌BA-DES4的基因组数据进行处理、注释和分析, 进一步探索菌株研究潜力, 以便更好地探究菌株产纤维素的调控机制, 为后续实验提供理论基础。 |
英文摘要: |
Objective To process, annotate and analyze the genomic data of the high temperature resistant Bacillus amyloliticus BA-DES4 by biological software and databases, and to functionally annotate the strain characteristics at the gene level. Methods Metabolic signaling pathways were analyzed by metabolic signaling pathways kyoto encyclopedia of genes and genomes (KEGG) and gene ontology (GO), clusters of orthologous groups of proteins (COG), non-redundant protein (NR) database, etc., which were compared with the predicted gene sequences to obtain the gene function annotation table. The protein sequences of the predicted genes were compared with COG, KEGG and GO databases for BLASTP analysis, to realize the prediction of gene annotation information and function prediction. Results The database analysis showed that the number of bases was 4188731, and the GC content accounted for 46.18%. There were 4445 protein genes, with a total length of 3696380 bp. Functional annotation of genomic protein coding genes showed that the highest proportion of COG-annotated genes was for G (carbohydrate transport and metabolism), followed by K (transcription), which indicated that protein genes encoded in the COG database were mainly involved in the basic cellular function. The GO annotation revealed that the predominant gene types and genes were classified under biological processes. The carbohydrate-active enzymes (CAZy) annotation identified hydrolytic enzymes as the most prevalent category. The KEGG annotation showed that carbohydrate metabolism accounted for the highest number of genes. In the environmental information category, information transduction emerged as the most significant percentage. In the high-temperature resistant Bacillus amyloliticus BA-DES4, 11 genes were found to encode cellulases, with β-glucosidase and endoglucanase being the genes encoding enzymes. Conclusion In this study, the genomic data of Bacillus amyloliticus BA-DES4 is processed, annotated and analysed in order to further explore the research potential of the strain, to better investigate the regulatory mechanism of cellulose production in Bacillus amyloliticus BA-DES4 and to provide a theoretical basis for subsequent experiments. |
查看全文 查看/发表评论 下载PDF阅读器 |
|
|
|