Mycobacterium sp. YC-RL4 is capable of utilizing a broad range of phthalic acid esters (PAEs) as sole source of carbon and energy for growth. The preliminary studies demonstrated its high degrading efficiency and good performance during the bioprocess with environmental samples. Here, we present the complete genome of Mycobacterium sp. YC-RL4, which consists of one circular chromosome (5,801,417bp) and one plasmid (252,568bp). The genomic analysis and gene annotation were performed and many potential genes responsible for the biodegradation of PAEs were identified from the genome. These results may advance the investigation of bioremediation of PAEs-contaminated environments by strain YC-RL4.
Plasticizers have been used as additives in plastic or polymeric materials to improve their flexibility, transparency, durability, and longevity. PAEs are among the most widely used plasticizers. These PAEs do not physically bind to the polymer matrix of products and can easily migrate to the environment. The detection of PAEs in soil, water, and air has been widely reported. The toxicity evaluation of PAEs indicated their developmental and reproductive toxicity to human and animals.1,2 PAEs are listed as top-priority environmental pollutants by the United States Environment Protection Agency (US EPA) and the European Union.3 Great efforts have been made to eliminate PAEs from the environment and biodegradation is among the most widely investigated methods.
Various PAEs-degrading bacteria have isolated and characterized, including genera Gordonia, Pseudomonas, Burkholderia, and Bacillus.4–8 Although PAEs-degrading isolates and degrading pathway were widely reported, the knowledge of degrading related molecular mechanism is limited. Previously, we isolated Mycobacterium sp. YC-RL4 from petroleum contaminated soil, which was capable of utilizing several kinds of PAEs as sole source of carbon and energy for growth.9 Here, we report the complete genome sequence of Mycobacterium sp. YC-RL4 and we hope the genomic information would advance our understanding of PAEs degrading mechanism, which may provide new gene resources for biotechnology and gene engineering.
The genomic DNA of strain YC-RL4 was extracted using the Bacterial Genomic DNA Extraction kit (Takara, Japan), and the quality and quantity were evaluated by Bioanalyzer 2100 (Agilent) before sequencing. The genome of strain YC-RL4 was sequenced using Single Molecular, Real-Time (SMRT) technology with the PacBio RS II platform. After quality control, a total length of 1,550,468,925bp data was obtained with 256-fold average coverage. All reads were de novo assembled using MHAP (v8.0).10,11 The assembled genome was annotated by NCBI Prokaryotic Genome Annotation Pipeline.12 Protein coding sequences, tRNA and rRNA were identified.
The generated genome sequences revealed strain YC-RL4 comprises 6,053,985bp, which were finally assembled into one circular chromosome (5,801,417bp) with an average G+C content of 67.48% and one plasmid (pMYC01, 252,568bp) with a average G+C content of 65.7%. In total, 5781 genes were predicted, including 5634 protein coding sequences, 47 tRNAs, and 6 rRNA genes. All the genomic information and annotated results were presented in Table 1.
Microorganisms can evolve different strategies to fit the environment. We analyzed the potential biodegradation related genes. Several genes and gene clusters located in the genome and plasmid that may contribute to the degradation of PAEs were identified. The biodegradation of PAEs was always initialed by hydrolyzing of two ester bonds (Ren et al., 2016). The generated phthalic acid (PA) was further utilized by ring cleavage. One esterase gene in the genome sequence responsible for the hydrolyzation of monoalkyl phthalates (MAPs) to PA (MAPs was the intermediate of PAEs catabolism) was identified.13,14 In addition, complete benzoate metabolism pathway was identified and located in genome sequence, which may be involved in the metabolism of PA. Some aromatic compounds catabolism related genes were also annotated in the genome. Meanwhile, annotation of pathway was performed by assigning predicted genes to Kyoto Encyclopedia of Genes and Genomes (KEGG) database15 and 2206 CDSs were involved in 117pathways. The genome information of strain YC-RL4 would be useful for elucidating the molecular mechanism of PAEs metabolism and provides new approach for the bioremediation of PAEs contaminated environments.
Strain and nucleotide sequence accession numbersThis strain has been deposited in China General Microbiological Culture Collection Center (CGMCC) with deposit number as CGMCC No. 10993. The chromosome and plasmid sequences of Mycobacterium sp. YC-RL4 were deposited in GenBank under the accession numbers CP015596 and CP015597, respectively.
Conflicts of interestThe authors declare no conflicts of interest.
This work was supported by the National Natural Science Foundation of China (31540067, 31170119), and Basic Research Fund of CAAS (0042014006, 0042012003 and 0042011006).