合成生物学 ›› 2021, Vol. 2 ›› Issue (5): 815-825.DOI: 10.12211/2096-8280.2021-005

• 研究论文 • 上一篇    下一篇

虫生真菌非核糖体多肽活性产物生物合成潜力预测

张礼文1, MOLNÁR István2, 徐玉泉1   

  1. 1.中国农业科学院生物技术研究所,北京 100081
    2.美国亚利桑那大学西南天然产物研究中心,亚利桑那州 图森 85706,美国
  • 收稿日期:2021-01-12 修回日期:2021-04-16 出版日期:2021-11-19 发布日期:2021-11-19
  • 通讯作者: MOLNáR István,徐玉泉
  • 作者简介:张礼文(1985—),女,副研究员。研究方向为从真菌中挖掘具有农用生物活性的次生代谢产物及其合成基因元件,再用于组合生物合成新骨架、改进活性的化合物。E-mail:zhangliwen@caas.cn
    István Molnár(1963—),男,研究员,博士生导师。研究方向为真菌天然产物生物合成与组合合成生物学。E-mail:imolnar@arizona.edu
    徐玉泉(1971—),男,研究员,博士生导师。研究方向为发现真菌合成的新型生物活性天然产物,阐明其生物合成机制,利用合成生物学的方法对目标产物进行高效生物合成及结构修饰。E-mail:xuyuquan@caas.cn
  • 基金资助:
    国家自然科学基金(32070064);中国农业科学院基本科研业务费专项(Y2020XK20);中国农业科学院科技创新工程

Potential biosynthesis of nonribosomal peptides by hypocrealean entomopathogenic fungi

Liwen ZHANG1, István MOLNÀR2, Yuquan XU1   

  1. 1.Biotechnology Research Institute,Chinese Academy of Agricultural Sciences,Beijing 100081,China
    2.Southwest Center for Natural Products Research,University of Arizona,Tucson,AZ 85706,USA
  • Received:2021-01-12 Revised:2021-04-16 Online:2021-11-19 Published:2021-11-19
  • Contact: István MOLNàR, Yuquan XU

摘要:

肉座菌目虫生真菌合成的非核糖体多肽类天然产物具有抗菌、杀虫、抗癌、调节免疫等生物活性,在临床和农业等领域有重要应用价值。近年来,随着真菌基因组测序数量的迅速增加、注释和分析工具的不断进步,人们发现真菌基因组中存在大量产物未知的天然产物合成基因。准确有效地预测这些基因的功能,筛选最有潜力合成新颖天然产物的基因簇,可以提高天然产物挖掘的效率。本研究选取40株肉座菌目虫生真菌基因组,系统分析了编码非核糖体多肽合成酶的基因及基因簇。基于隐马尔可夫模型,预测了445个模块型非核糖体多肽合成酶和1243个类非核糖体多肽合成酶;通过提取腺苷酰化结构域,构建序列相似性网络,以已知功能的非核糖体多肽合成酶作为标签,利用马尔可夫聚类算法,分析了非核糖体多肽产物的主要类别,发现了可能合成线性多肽、环缩肽、脂多肽、生物碱等新型结构化合物的基因簇。研究结果不仅揭示了肉座菌目虫生真菌合成非核糖体多肽活性产物的巨大潜力,而且为通过激活沉默基因簇挖掘新产物、利用合成生物学手段改造合成途径提供参考。

关键词: 肉座菌目虫生真菌, 非核糖体多肽合成酶, 非核糖体多肽, 生物合成基因簇, 序列相似性网络分析

Abstract:

Nonribosomal peptide natural products synthesized by Hypocrealean entomopathogenic fungi have antibacterial, insecticidal, anticancer, immunomodulatory and other biological activities, with high potential for application in clinical or agricultural fields. These bioactive compounds are synthesized by nonribosomal peptide synthetases (NRPSs) and tailored by additional enzymes that are encoded by clustered genes. In addition to the 20 amino acids, non-canonical amino acids as well as α-hydroxy acids can also be incorporated into nonribosomal peptides as structural units, and these, together with further modifications, empower an almost unlimited structural diversity. The rapid increase in the number of sequenced fungal genomes shows that there exists a large number of nonribosomal peptide biosynthetic gene clusters with unknown functions. Accurate and effective prediction of the functions of these "orphan" biosynthetic gene clusters can help to select those clusters that have potential to synthesize novel natural products, and increase the efficiency of natural product genome mining. In this study, we systematically analyzed the genes encoding NRPSs and their gene clusters in the genomes of 40 strains from Hypocrealean entomopathogenic fungi. Genes encoding NRPSs were predicted and categorized based on the hidden Markov models for adenylation, condensation and thiolation domains to reveal 445 modular NRPSs and 1243 NRPS-like proteins. A sequence similarity network based on the amino acid sequences of adenylation domains of these synthetases was constructed. Using adenylation domains from known NRPSs as references, we analyzed the main categories of predicted nonribosomal peptide products using the Markov clustering algorithm. We identified several biosynthetic gene clusters that potentially yield known bioactive compounds or their congeners. In addition, biosynthetic gene clusters that may be able to produce new bioactive compounds belonging to the linear peptide, cyclic peptide, lipopeptide, and alkaloid structural classes were also discovered. The results reveal a great potential of Hypocrealean entomopathogenic fungi to synthesize nonribosomal peptides, and provide insight for genome mining to identify new products by activating silent gene clusters or modifying biosynthetic pathways by synthetic biology methods.

Key words: hypocrealean entomopathogenic fungi, nonribosomal peptide synthetase (NRPS), nonribosomal peptide (NRP), biosynthetic gene clusters (BGCs), sequence similarity network analysis (SSN)

中图分类号: