Other Sites:
Robert J. Robbins is a biologist, an educator, a science administrator, a publisher, an information technologist, and an IT leader and manager who specializes in advancing biomedical knowledge and supporting education through the application of information technology. More About: RJR | OUR TEAM | OUR SERVICES | THIS WEBSITE
RJR: Recommended Bibliography 18 Mar 2024 at 01:34 Created:
Pangenome
Although the enforced stability of genomic content is ubiquitous among MCEs, the opposite is proving to be the case among prokaryotes, which exhibit remarkable and adaptive plasticity of genomic content. Early bacterial whole-genome sequencing efforts discovered that whenever a particular "species" was re-sequenced, new genes were found that had not been detected earlier — entirely new genes, not merely new alleles. This led to the concepts of the bacterial core-genome, the set of genes found in all members of a particular "species", and the flex-genome, the set of genes found in some, but not all members of the "species". Together these make up the species' pan-genome.
Created with PubMed® Query: ( pangenome OR "pan-genome" OR "pan genome" ) NOT pmcbook NOT ispreviousversion
Citations The Papers (from PubMed®)
RevDate: 2024-03-16
Harnessing the predicted maize pan-interactome for putative gene function prediction and prioritization of candidate genes for important traits.
G3 (Bethesda, Md.) pii:7630293 [Epub ahead of print].
The recent assembly and annotation of the 26 maize nested association mapping (NAM) population founder inbreds have enabled large-scale pan-genomic comparative studies. These studies have expanded our understanding of agronomically important traits by integrating pan-transcriptomic data with trait-specific gene candidates from previous association mapping results. In contrast to the availability of pan-transcriptomic data, obtaining reliable protein-protein interaction (PPI) data has remained a challenge due to its high cost and complexity. We generated predicted PPI networks for each of the 26 genomes using the established STRING database. The individual genome-interactomes were then integrated to generate core- and pan-interactomes. We deployed the PPI clustering algorithm ClusterONE to identify numerous PPI clusters that were functionally annotated using gene ontology (GO) functional enrichment, demonstrating a diverse range of enriched GO terms across different clusters. Additional cluster annotations were generated by integrating gene co-expression data and gene description annotations, providing additional useful information. We show that the functionally annotated PPI clusters establish a useful framework for protein function prediction and prioritization of candidate genes of interest. Our study not only provides a comprehensive resource of predicted PPI networks for 26 maize genomes, but also offers annotated interactome clusters for predicting protein functions and prioritizing gene candidates. The source code for the Python implementation of the analysis workflow and a standalone web application for accessing the analysis results are available at https://github.com/eporetsky/PanPPI.
Additional Links: PMID-38492232
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38492232,
year = {2024},
author = {Poretsky, E and Cagirici, HB and Andorf, CM and Sen, TZ},
title = {Harnessing the predicted maize pan-interactome for putative gene function prediction and prioritization of candidate genes for important traits.},
journal = {G3 (Bethesda, Md.)},
volume = {},
number = {},
pages = {},
doi = {10.1093/g3journal/jkae059},
pmid = {38492232},
issn = {2160-1836},
abstract = {The recent assembly and annotation of the 26 maize nested association mapping (NAM) population founder inbreds have enabled large-scale pan-genomic comparative studies. These studies have expanded our understanding of agronomically important traits by integrating pan-transcriptomic data with trait-specific gene candidates from previous association mapping results. In contrast to the availability of pan-transcriptomic data, obtaining reliable protein-protein interaction (PPI) data has remained a challenge due to its high cost and complexity. We generated predicted PPI networks for each of the 26 genomes using the established STRING database. The individual genome-interactomes were then integrated to generate core- and pan-interactomes. We deployed the PPI clustering algorithm ClusterONE to identify numerous PPI clusters that were functionally annotated using gene ontology (GO) functional enrichment, demonstrating a diverse range of enriched GO terms across different clusters. Additional cluster annotations were generated by integrating gene co-expression data and gene description annotations, providing additional useful information. We show that the functionally annotated PPI clusters establish a useful framework for protein function prediction and prioritization of candidate genes of interest. Our study not only provides a comprehensive resource of predicted PPI networks for 26 maize genomes, but also offers annotated interactome clusters for predicting protein functions and prioritizing gene candidates. The source code for the Python implementation of the analysis workflow and a standalone web application for accessing the analysis results are available at https://github.com/eporetsky/PanPPI.},
}
RevDate: 2024-03-16
Detection of colinear blocks and synteny and evolutionary analyses based on utilization of MCScanX.
Nature protocols [Epub ahead of print].
As different taxa evolve, gene order often changes slowly enough that chromosomal 'blocks' with conserved gene orders (synteny) are discernible. The MCScanX toolkit (https://github.com/wyp1125/MCScanX) was published in 2012 as freely available software for the detection of such 'colinear blocks' and subsequent synteny and evolutionary analyses based on genome-wide gene location and protein sequence information. Owing to its simplicity and high efficiency for colinear block detection, MCScanX provides a powerful tool for conducting diverse synteny and evolutionary analyses. Moreover, the detection of colinear blocks has been embraced as an integral step for pangenome graph construction. Here, new application trends of MCScanX are explored, striving to better connect this increasingly used tool to other tools and accelerate insight generation from exponentially growing sequence data. We provide a detailed protocol that covers how to install MCScanX on diverse platforms, tune parameters, prepare input files from data from the National Center for Biotechnology Information, run MCScanX and its visualization and evolutionary analysis tools, and connect MCScanX with external tools, including MCScanX-transposed, Circos and SynVisio. This protocol is easily implemented by users with minimal computational background and is adaptable to new data of interest to them. The data and utility programs for this protocol can be obtained from http://bdx-consulting.com/mcscanx-protocol .
Additional Links: PMID-38491145
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38491145,
year = {2024},
author = {Wang, Y and Tang, H and Wang, X and Sun, Y and Joseph, PV and Paterson, AH},
title = {Detection of colinear blocks and synteny and evolutionary analyses based on utilization of MCScanX.},
journal = {Nature protocols},
volume = {},
number = {},
pages = {},
pmid = {38491145},
issn = {1750-2799},
abstract = {As different taxa evolve, gene order often changes slowly enough that chromosomal 'blocks' with conserved gene orders (synteny) are discernible. The MCScanX toolkit (https://github.com/wyp1125/MCScanX) was published in 2012 as freely available software for the detection of such 'colinear blocks' and subsequent synteny and evolutionary analyses based on genome-wide gene location and protein sequence information. Owing to its simplicity and high efficiency for colinear block detection, MCScanX provides a powerful tool for conducting diverse synteny and evolutionary analyses. Moreover, the detection of colinear blocks has been embraced as an integral step for pangenome graph construction. Here, new application trends of MCScanX are explored, striving to better connect this increasingly used tool to other tools and accelerate insight generation from exponentially growing sequence data. We provide a detailed protocol that covers how to install MCScanX on diverse platforms, tune parameters, prepare input files from data from the National Center for Biotechnology Information, run MCScanX and its visualization and evolutionary analysis tools, and connect MCScanX with external tools, including MCScanX-transposed, Circos and SynVisio. This protocol is easily implemented by users with minimal computational background and is adaptable to new data of interest to them. The data and utility programs for this protocol can be obtained from http://bdx-consulting.com/mcscanx-protocol .},
}
RevDate: 2024-03-15
Microbial biofilms on macroalgae harbour diverse integron gene cassettes.
Microbiology (Reading, England), 170(3):.
Integrons are genetic platforms that capture, rearrange and express mobile modules called gene cassettes. The best characterized gene cassettes encode antibiotic resistance, but the function of most integron gene cassettes remains unknown. Functional predictions suggest that many gene cassettes could encode proteins that facilitate interactions with other cells and with the extracellular environment. Because cell interactions are essential for biofilm stability, we sequenced gene cassettes from biofilms growing on the surface of the marine macroalgae Ulva australis and Sargassum linearifolium. Algal samples were obtained from coastal rock platforms around Sydney, Australia, using seawater as a control. We demonstrated that integrons in microbial biofilms did not sample genes randomly from the surrounding seawater, but harboured specific functions that potentially provided an adaptive advantage to both the bacterial cells in biofilm communities and their macroalgal host. Further, integron gene cassettes had a well-defined spatial distribution, suggesting that each bacterial biofilm acquired these genetic elements via sampling from a large but localized pool of gene cassettes. These findings suggest two forms of filtering: a selective acquisition of different integron-containing bacterial species into the distinct biofilms on Ulva and Sargassum surfaces, and a selective retention of unique populations of gene cassettes at each sampling location.
Additional Links: PMID-38488860
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38488860,
year = {2024},
author = {Freddi, S and Rajabal, V and Tetu, SG and Gillings, MR and Penesyan, A},
title = {Microbial biofilms on macroalgae harbour diverse integron gene cassettes.},
journal = {Microbiology (Reading, England)},
volume = {170},
number = {3},
pages = {},
doi = {10.1099/mic.0.001446},
pmid = {38488860},
issn = {1465-2080},
abstract = {Integrons are genetic platforms that capture, rearrange and express mobile modules called gene cassettes. The best characterized gene cassettes encode antibiotic resistance, but the function of most integron gene cassettes remains unknown. Functional predictions suggest that many gene cassettes could encode proteins that facilitate interactions with other cells and with the extracellular environment. Because cell interactions are essential for biofilm stability, we sequenced gene cassettes from biofilms growing on the surface of the marine macroalgae Ulva australis and Sargassum linearifolium. Algal samples were obtained from coastal rock platforms around Sydney, Australia, using seawater as a control. We demonstrated that integrons in microbial biofilms did not sample genes randomly from the surrounding seawater, but harboured specific functions that potentially provided an adaptive advantage to both the bacterial cells in biofilm communities and their macroalgal host. Further, integron gene cassettes had a well-defined spatial distribution, suggesting that each bacterial biofilm acquired these genetic elements via sampling from a large but localized pool of gene cassettes. These findings suggest two forms of filtering: a selective acquisition of different integron-containing bacterial species into the distinct biofilms on Ulva and Sargassum surfaces, and a selective retention of unique populations of gene cassettes at each sampling location.},
}
RevDate: 2024-03-15
Annotation of 2,507 Saccharomyces cerevisiae genomes.
Microbiology spectrum [Epub ahead of print].
Saccharomyces cerevisiae (baker's yeast, budding yeast) is one of the most important model organisms for biological research and is a crucial microorganism in industry. Currently, a huge number of Saccharomyces cerevisiae genome sequences are available at the public domain. However, these genomes are distributed at different websites and a large number of them are released without annotation information. To provide one complete annotated genome data resource, we collected 2,507 Saccharomyces cerevisiae genome assemblies and re-annotated 2,506 assemblies using a custom annotation pipeline, producing a total of 15,407,164 protein-coding gene models. With a custom pipeline, all these gene sequences were clustered into families. A total of 1,506 single-copy genes were selected as marker genes, which were then used to evaluate the genome completeness and base qualities of all assemblies. Pangenomic analyses were performed based on a selected subset of 847 medium-high-quality genomes. Statistical comparisons revealed a number of gene families showing copy number variations among different organism sources. To the authors' knowledge, this study represents the largest genome annotation project of S. cerevisiae so far, providing rich genomic resources for the future studies of the model organism S. cerevisiae and its relatives.IMPORTANCESaccharomyces cerevisiae (baker's yeast, budding yeast) is one of the most important model organisms for biological research and is a crucial microorganism in industry. Though a huge number of Saccharomyces cerevisiae genome sequences are available at the public domain, these genomes are distributed at different websites and most are released without annotation, hindering the efficient reuse of these genome resources. Here, we collected 2,507 genomes for Saccharomyces cerevisiae, performed genome annotation, and evaluated the genome qualities. All the obtained data have been deposited at public repositories and are freely accessible to the community. This study represents the largest genome annotation project of S. cerevisiae so far, providing one complete annotated genome data set for S. cerevisiae, an important workhorse for fundamental biology, biotechnology, and industry.
Additional Links: PMID-38488392
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38488392,
year = {2024},
author = {Wang, M and Li, X and Liu, X and Hou, X and He, Y and Yu, J-H and Hu, S and Yin, H and Xie, B-B},
title = {Annotation of 2,507 Saccharomyces cerevisiae genomes.},
journal = {Microbiology spectrum},
volume = {},
number = {},
pages = {e0358223},
doi = {10.1128/spectrum.03582-23},
pmid = {38488392},
issn = {2165-0497},
abstract = {Saccharomyces cerevisiae (baker's yeast, budding yeast) is one of the most important model organisms for biological research and is a crucial microorganism in industry. Currently, a huge number of Saccharomyces cerevisiae genome sequences are available at the public domain. However, these genomes are distributed at different websites and a large number of them are released without annotation information. To provide one complete annotated genome data resource, we collected 2,507 Saccharomyces cerevisiae genome assemblies and re-annotated 2,506 assemblies using a custom annotation pipeline, producing a total of 15,407,164 protein-coding gene models. With a custom pipeline, all these gene sequences were clustered into families. A total of 1,506 single-copy genes were selected as marker genes, which were then used to evaluate the genome completeness and base qualities of all assemblies. Pangenomic analyses were performed based on a selected subset of 847 medium-high-quality genomes. Statistical comparisons revealed a number of gene families showing copy number variations among different organism sources. To the authors' knowledge, this study represents the largest genome annotation project of S. cerevisiae so far, providing rich genomic resources for the future studies of the model organism S. cerevisiae and its relatives.IMPORTANCESaccharomyces cerevisiae (baker's yeast, budding yeast) is one of the most important model organisms for biological research and is a crucial microorganism in industry. Though a huge number of Saccharomyces cerevisiae genome sequences are available at the public domain, these genomes are distributed at different websites and most are released without annotation, hindering the efficient reuse of these genome resources. Here, we collected 2,507 genomes for Saccharomyces cerevisiae, performed genome annotation, and evaluated the genome qualities. All the obtained data have been deposited at public repositories and are freely accessible to the community. This study represents the largest genome annotation project of S. cerevisiae so far, providing one complete annotated genome data set for S. cerevisiae, an important workhorse for fundamental biology, biotechnology, and industry.},
}
RevDate: 2024-03-15
Spatial ecology of Haemophilus and Aggregatibacter in the human oral cavity.
Microbiology spectrum [Epub ahead of print].
UNLABELLED: Haemophilus and Aggregatibacter are two of the most common bacterial genera in the human oral cavity, encompassing both commensals and pathogens of substantial ecological and medical significance. In this study, we conducted a metapangenomic analysis of oral Haemophilus and Aggregatibacter species to uncover genomic diversity, phylogenetic relationships, and habitat specialization within the human oral cavity. Using three metrics-pangenomic gene content, phylogenomics, and average nucleotide identity (ANI)-we first identified distinct species and sub-species groups among these genera. Mapping of metagenomic reads then revealed clear patterns of habitat specialization, such as Aggregatibacter species predominantly in dental plaque, a distinctive Haemophilus parainfluenzae sub-species group on the tongue dorsum, and H. sp. HMT-036 predominantly in keratinized gingiva and buccal mucosa. In addition, we found that supragingival plaque samples contained predominantly only one out of the three taxa, H. parainfluenzae, Aggregatibacter aphrophilus, and A. sp. HMT-458, suggesting independent niches or a competitive relationship. Functional analyses revealed the presence of key metabolic genes, such as oxaloacetate decarboxylase, correlated with habitat specialization, suggesting metabolic versatility as a driving force. Additionally, heme synthesis distinguishes H. sp. HMT-036 from closely related Haemophilus haemolyticus, suggesting that the availability of micronutrients, particularly iron, was important in the evolutionary ecology of these species. Overall, our study exemplifies the power of metapangenomics to identify factors that may affect ecological interactions within microbial communities, including genomic diversity, habitat specialization, and metabolic versatility.
IMPORTANCE: Understanding the microbial ecology of the mouth is essential for comprehending human physiology. This study employs metapangenomics to reveal that various Haemophilus and Aggregatibacter species exhibit distinct ecological preferences within the oral cavity of healthy individuals, thereby supporting the site-specialist hypothesis. Additionally, it was observed that the gene pool of different Haemophilus species correlates with their ecological niches. These findings shed light on the significance of key metabolic functions in shaping microbial distribution patterns and interspecies interactions in the oral ecosystem.
Additional Links: PMID-38488280
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38488280,
year = {2024},
author = {Giacomini, JJ and Torres-Morales, J and Tang, J and Dewhirst, FE and Borisy, GG and Mark Welch, JL},
title = {Spatial ecology of Haemophilus and Aggregatibacter in the human oral cavity.},
journal = {Microbiology spectrum},
volume = {},
number = {},
pages = {e0401723},
doi = {10.1128/spectrum.04017-23},
pmid = {38488280},
issn = {2165-0497},
abstract = {UNLABELLED: Haemophilus and Aggregatibacter are two of the most common bacterial genera in the human oral cavity, encompassing both commensals and pathogens of substantial ecological and medical significance. In this study, we conducted a metapangenomic analysis of oral Haemophilus and Aggregatibacter species to uncover genomic diversity, phylogenetic relationships, and habitat specialization within the human oral cavity. Using three metrics-pangenomic gene content, phylogenomics, and average nucleotide identity (ANI)-we first identified distinct species and sub-species groups among these genera. Mapping of metagenomic reads then revealed clear patterns of habitat specialization, such as Aggregatibacter species predominantly in dental plaque, a distinctive Haemophilus parainfluenzae sub-species group on the tongue dorsum, and H. sp. HMT-036 predominantly in keratinized gingiva and buccal mucosa. In addition, we found that supragingival plaque samples contained predominantly only one out of the three taxa, H. parainfluenzae, Aggregatibacter aphrophilus, and A. sp. HMT-458, suggesting independent niches or a competitive relationship. Functional analyses revealed the presence of key metabolic genes, such as oxaloacetate decarboxylase, correlated with habitat specialization, suggesting metabolic versatility as a driving force. Additionally, heme synthesis distinguishes H. sp. HMT-036 from closely related Haemophilus haemolyticus, suggesting that the availability of micronutrients, particularly iron, was important in the evolutionary ecology of these species. Overall, our study exemplifies the power of metapangenomics to identify factors that may affect ecological interactions within microbial communities, including genomic diversity, habitat specialization, and metabolic versatility.
IMPORTANCE: Understanding the microbial ecology of the mouth is essential for comprehending human physiology. This study employs metapangenomics to reveal that various Haemophilus and Aggregatibacter species exhibit distinct ecological preferences within the oral cavity of healthy individuals, thereby supporting the site-specialist hypothesis. Additionally, it was observed that the gene pool of different Haemophilus species correlates with their ecological niches. These findings shed light on the significance of key metabolic functions in shaping microbial distribution patterns and interspecies interactions in the oral ecosystem.},
}
RevDate: 2024-03-15
Genetic and technological diversity of Streptococcus thermophilus isolated from the Saint-Nectaire PDO cheese-producing area.
Frontiers in microbiology, 14:1245510.
Streptococcus thermophilus is of major importance for cheese manufacturing to ensure rapid acidification; however, studies indicate that intensive use of commercial strains leads to the loss of typical characteristics of the products. To strengthen the link between the product and its geographical area and improve the sensory qualities of cheeses, cheese-producing protected designations of origin (PDO) are increasingly interested in the development of specific autochthonous starter cultures. The present study is therefore investigating the genetic and functional diversity of S. thermophilus strains isolated from a local cheese-producing PDO area. Putative S. thermophilus isolates were isolated and identified from milk collected in the Saint-Nectaire cheese-producing PDO area and from commercial starters. Whole genomes of isolates were sequenced, and a comparative analysis based on their pan-genome was carried out. Important functional properties were studied, including acidifying and proteolytic activities. Twenty-two isolates representative of the diversity of the geographical area and four commercial strains were selected for comparison. The resulting phylogenetic trees do not correspond to the geographical distribution of isolates. The clustering based on the pan-genome analysis indicates that isolates are divided into five distinct groups. A Kyoto Encyclopedia of Genes and Genomes (KEGG) functional annotation of the accessory genes indicates that the accessory gene contents of isolates are involved in different functional categories. High variability in acidifying activities and less diversity in proteolytic activities were also observed. These results indicate that high genetic and functional variabilities of the species S. thermophilus may arise from a small (1,800 km[2]) geographical area and may be exploited to meet demand for use as autochthonous starters.
Additional Links: PMID-38487210
Full Text:
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38487210,
year = {2023},
author = {Grizon, A and Theil, S and Callon, C and Gerber, P and Helinck, S and Dugat-Bony, E and Bonnarme, P and Chassard, C},
title = {Genetic and technological diversity of Streptococcus thermophilus isolated from the Saint-Nectaire PDO cheese-producing area.},
journal = {Frontiers in microbiology},
volume = {14},
number = {},
pages = {1245510},
doi = {10.3389/fmicb.2023.1245510},
pmid = {38487210},
issn = {1664-302X},
abstract = {Streptococcus thermophilus is of major importance for cheese manufacturing to ensure rapid acidification; however, studies indicate that intensive use of commercial strains leads to the loss of typical characteristics of the products. To strengthen the link between the product and its geographical area and improve the sensory qualities of cheeses, cheese-producing protected designations of origin (PDO) are increasingly interested in the development of specific autochthonous starter cultures. The present study is therefore investigating the genetic and functional diversity of S. thermophilus strains isolated from a local cheese-producing PDO area. Putative S. thermophilus isolates were isolated and identified from milk collected in the Saint-Nectaire cheese-producing PDO area and from commercial starters. Whole genomes of isolates were sequenced, and a comparative analysis based on their pan-genome was carried out. Important functional properties were studied, including acidifying and proteolytic activities. Twenty-two isolates representative of the diversity of the geographical area and four commercial strains were selected for comparison. The resulting phylogenetic trees do not correspond to the geographical distribution of isolates. The clustering based on the pan-genome analysis indicates that isolates are divided into five distinct groups. A Kyoto Encyclopedia of Genes and Genomes (KEGG) functional annotation of the accessory genes indicates that the accessory gene contents of isolates are involved in different functional categories. High variability in acidifying activities and less diversity in proteolytic activities were also observed. These results indicate that high genetic and functional variabilities of the species S. thermophilus may arise from a small (1,800 km[2]) geographical area and may be exploited to meet demand for use as autochthonous starters.},
}
RevDate: 2024-03-15
The super-pangenome of Populus unveil genomic facets for its adaptation and diversification in widespread forest trees.
Molecular plant pii:S1674-2052(24)00082-0 [Epub ahead of print].
Understanding the underlying mechanisms and links between genome evolution and adaptive innovations stands as a key goal in evolutionary studies. Poplars, among the world's most widely distributed and cultivated trees, exhibit extensive phenotypic diversity and environmental adaptability. In this study, we present a genus-level super-pangenome comprising 19 Populus genomes, revealing the likely pivotal role of private genes in facilitating local environmental and climate adaptation. Through the integration of pan-genomes with transcriptomes, methylomes and chromatin accessibility mapping, we unveil that the evolutionary trajectories of pan-genes and duplicated genes are closely linked to local genomic landscapes of regulatory and epigenetic architectures, notably CG methylation in gene-body regions. Further comparative genomic analyses have enabled the identification of 142,202 structural variants (SVs) across species, which intersect with a significant number of genes and contribute substantially to both phenotypic and adaptive divergence. We have experimentally validated a ∼180 bp presence/absence variant impacting the expression of the CUC2 gene, crucial for leaf serration formation. Finally, we developed a user-friendly web-based tool encompassing the multi-omics resources associated with the Populus super-pangenome (http://www.populus-superpangenome.com/). Together, the present pioneering super-pangenome resource in forest trees not only aid in the advancement of breeding efforts of this globally important tree genus but also offer valuable insights into potential avenues for comprehending tree biology.
Additional Links: PMID-38486452
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38486452,
year = {2024},
author = {Shi, T and Zhang, X and Hou, Y and Jia, C and Dan, X and Zhang, Y and Jiang, Y and Lai, Q and Feng, J and Feng, J and Ma, T and Wu, J and Liu, S and Zhang, L and Long, Z and Chen, L and Street, NR and Ingvarsson, PK and Liu, J and Yin, T and Wang, J},
title = {The super-pangenome of Populus unveil genomic facets for its adaptation and diversification in widespread forest trees.},
journal = {Molecular plant},
volume = {},
number = {},
pages = {},
doi = {10.1016/j.molp.2024.03.009},
pmid = {38486452},
issn = {1752-9867},
abstract = {Understanding the underlying mechanisms and links between genome evolution and adaptive innovations stands as a key goal in evolutionary studies. Poplars, among the world's most widely distributed and cultivated trees, exhibit extensive phenotypic diversity and environmental adaptability. In this study, we present a genus-level super-pangenome comprising 19 Populus genomes, revealing the likely pivotal role of private genes in facilitating local environmental and climate adaptation. Through the integration of pan-genomes with transcriptomes, methylomes and chromatin accessibility mapping, we unveil that the evolutionary trajectories of pan-genes and duplicated genes are closely linked to local genomic landscapes of regulatory and epigenetic architectures, notably CG methylation in gene-body regions. Further comparative genomic analyses have enabled the identification of 142,202 structural variants (SVs) across species, which intersect with a significant number of genes and contribute substantially to both phenotypic and adaptive divergence. We have experimentally validated a ∼180 bp presence/absence variant impacting the expression of the CUC2 gene, crucial for leaf serration formation. Finally, we developed a user-friendly web-based tool encompassing the multi-omics resources associated with the Populus super-pangenome (http://www.populus-superpangenome.com/). Together, the present pioneering super-pangenome resource in forest trees not only aid in the advancement of breeding efforts of this globally important tree genus but also offer valuable insights into potential avenues for comprehending tree biology.},
}
RevDate: 2024-03-13
Bridging drug discovery through hierarchical subtractive genomics against asd, trpG, and secY of pneumonia causing MDR Staphylococcus aureus.
Molecular genetics and genomics : MGG, 299(1):34.
Staphylococcus aureus (S. aureus) is an opportunistic gram-positive, non-motile, and non-sporulating bacteria that induces pneumonia, a provocative lung infection affecting mainly the terminal bronchioles and the small air sacs known as alveoli. Recently, it has developed antibiotic resistance to the available consortium as per the WHO reports; thereby, novel remedial targets and resilient medications to forestall and cure this illness are desperately needed. Here, using pan-genomics, a total of 1,387 core proteins were identified. Subtractive proteome analyses further identified 12 proteins that are vital for bacteria. One membrane protein (secY) and two cytoplasmic proteins (asd and trpG) were chosen as possible therapeutic targets concerning minimum % host identity, essentiality, and other cutoff values, such as high resistance in the MDR S. aureus. The UniProt AA sequences of the selected targets were modelled and docked against 3 drug-like chemical libraries. The top-ranked compounds i.e., ZINC82049692, ZINC85492658 and 3a of Isosteviol derivative for Aspartate-semialdehyde dehydrogenase (asd); ZINC38222743, ZINC70455378, and 5 m Isosteviol derivative for Anthranilate synthase component II (trpG); and finally, ZINC72292296, ZINC85632684, and 7 m Isosteviol derivative for Protein translocase subunit secY (secY), were further subjected to molecular dynamics studies for thermodynamic stability and energy calculation. Our study proposes new therapeutic targets in S. aureus, some of which have previously been reported in other pathogenic microorganisms. Owing to further experimental validation, we anticipate that the adapted methodology and the predicted results in this work could make major contributions towards novel drug discovery and their targets in S. aureus caused pneumonia.
Additional Links: PMID-38478130
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38478130,
year = {2024},
author = {Wisal, A and Saeed, N and Aurongzeb, M and Shafique, M and Sohail, S and Anwar, W and Basharat, Z and Irfan, M and Ullah, A and Hassan, SS},
title = {Bridging drug discovery through hierarchical subtractive genomics against asd, trpG, and secY of pneumonia causing MDR Staphylococcus aureus.},
journal = {Molecular genetics and genomics : MGG},
volume = {299},
number = {1},
pages = {34},
pmid = {38478130},
issn = {1617-4623},
abstract = {Staphylococcus aureus (S. aureus) is an opportunistic gram-positive, non-motile, and non-sporulating bacteria that induces pneumonia, a provocative lung infection affecting mainly the terminal bronchioles and the small air sacs known as alveoli. Recently, it has developed antibiotic resistance to the available consortium as per the WHO reports; thereby, novel remedial targets and resilient medications to forestall and cure this illness are desperately needed. Here, using pan-genomics, a total of 1,387 core proteins were identified. Subtractive proteome analyses further identified 12 proteins that are vital for bacteria. One membrane protein (secY) and two cytoplasmic proteins (asd and trpG) were chosen as possible therapeutic targets concerning minimum % host identity, essentiality, and other cutoff values, such as high resistance in the MDR S. aureus. The UniProt AA sequences of the selected targets were modelled and docked against 3 drug-like chemical libraries. The top-ranked compounds i.e., ZINC82049692, ZINC85492658 and 3a of Isosteviol derivative for Aspartate-semialdehyde dehydrogenase (asd); ZINC38222743, ZINC70455378, and 5 m Isosteviol derivative for Anthranilate synthase component II (trpG); and finally, ZINC72292296, ZINC85632684, and 7 m Isosteviol derivative for Protein translocase subunit secY (secY), were further subjected to molecular dynamics studies for thermodynamic stability and energy calculation. Our study proposes new therapeutic targets in S. aureus, some of which have previously been reported in other pathogenic microorganisms. Owing to further experimental validation, we anticipate that the adapted methodology and the predicted results in this work could make major contributions towards novel drug discovery and their targets in S. aureus caused pneumonia.},
}
RevDate: 2024-03-13
Comparative genomic analysis of Pseudomonas aeruginosa strains susceptible and resistant to carbapenems and aztreonam isolated from patients with healthcare-associated infections in a Mexican hospital.
Molecular genetics and genomics : MGG, 299(1):29.
Pseudomonas aeruginosa (PA) is an important opportunistic pathogen that causes different infections on immunocompromised patients. Within PA accessory genome, differences in virulence, antibiotic resistance and biofilm formation have been described between strains, leading to the emergence of multidrug-resistant strains. The genome sequences of 17 strains isolated from patients with healthcare-associated infections in a Mexican hospital were genomically and phylogenetically analyzed and antibiotic resistance genes, virulence genes, and biofilm formation genes were detected. Fifteen of the 17 strains were resistant to at least two of the carbapenems meropenem, imipenem, and the monobactam aztreonam. The antibiotic resistance (mexA, mexB, and oprM) and the biofilm formation (pslA and pslD) genes were detected in all strains. Differences were found between strains in accessory genome size. The strains had different sequence types, and seven strains had sequence types associated with global high risk epidemic PA clones. All strains were represented in two groups among PA global strains. In the 17 strains, horizontally acquired resistance genes to aminoglycosides and beta-lactams were found, mainly, and between 230 and 240 genes that encode virulence factors. The strains under study were variable in terms of their accessory genome, antibiotic resistance, and virulence genes. With these characteristics, we provide information about the genomic diversity of clinically relevant PA strains.
Additional Links: PMID-38472486
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38472486,
year = {2024},
author = {Martínez-Gallardo, MJ and Villicaña, C and Yocupicio-Monroy, M and Alcaraz-Estrada, SL and Salazar-Salinas, J and Mendoza-Vázquez, OF and Damazo-Hernández, G and León-Félix, J},
title = {Comparative genomic analysis of Pseudomonas aeruginosa strains susceptible and resistant to carbapenems and aztreonam isolated from patients with healthcare-associated infections in a Mexican hospital.},
journal = {Molecular genetics and genomics : MGG},
volume = {299},
number = {1},
pages = {29},
pmid = {38472486},
issn = {1617-4623},
support = {E05//Ciencia y Tecnología ISSSTE/ ; },
abstract = {Pseudomonas aeruginosa (PA) is an important opportunistic pathogen that causes different infections on immunocompromised patients. Within PA accessory genome, differences in virulence, antibiotic resistance and biofilm formation have been described between strains, leading to the emergence of multidrug-resistant strains. The genome sequences of 17 strains isolated from patients with healthcare-associated infections in a Mexican hospital were genomically and phylogenetically analyzed and antibiotic resistance genes, virulence genes, and biofilm formation genes were detected. Fifteen of the 17 strains were resistant to at least two of the carbapenems meropenem, imipenem, and the monobactam aztreonam. The antibiotic resistance (mexA, mexB, and oprM) and the biofilm formation (pslA and pslD) genes were detected in all strains. Differences were found between strains in accessory genome size. The strains had different sequence types, and seven strains had sequence types associated with global high risk epidemic PA clones. All strains were represented in two groups among PA global strains. In the 17 strains, horizontally acquired resistance genes to aminoglycosides and beta-lactams were found, mainly, and between 230 and 240 genes that encode virulence factors. The strains under study were variable in terms of their accessory genome, antibiotic resistance, and virulence genes. With these characteristics, we provide information about the genomic diversity of clinically relevant PA strains.},
}
RevDate: 2024-03-12
Anaerostipes hadrus, a butyrate-producing bacterium capable of metabolizing 5-fluorouracil.
mSphere [Epub ahead of print].
UNLABELLED: Anaerostipes hadrus (A. hadrus) is a dominant species in the human gut microbiota and considered a beneficial bacterium for producing probiotic butyrate. However, recent studies have suggested that A. hadrus may negatively affect the host through synthesizing fatty acid and metabolizing the anticancer drug 5-fluorouracil, indicating that the impact of A. hadrus is complex and unclear. Therefore, comprehensive genomic studies on A. hadrus need to be performed. We integrated 527 high-quality public A. hadrus genomes and five distinct metagenomic cohorts. We analyzed these data using the approaches of comparative genomics, metagenomics, and protein structure prediction. We also performed validations with culture-based in vitro assays. We constructed the first large-scale pan-genome of A. hadrus (n = 527) and identified 5-fluorouracil metabolism genes as ubiquitous in A. hadrus genomes as butyrate-producing genes. Metagenomic analysis revealed the wide and stable distribution of A. hadrus in healthy individuals, patients with inflammatory bowel disease, and patients with colorectal cancer, with healthy individuals carrying more A. hadrus. The predicted high-quality protein structure indicated that A. hadrus might metabolize 5-fluorouracil by producing bacterial dihydropyrimidine dehydrogenase (encoded by the preTA operon). Through in vitro assays, we validated the short-chain fatty acid production and 5-fluorouracil metabolism abilities of A. hadrus. We observed for the first time that A. hadrus can convert 5-fluorouracil to α-fluoro-β-ureidopropionic acid, which may result from the combined action of the preTA operon and adjacent hydA (encoding bacterial dihydropyrimidinase). Our results offer novel understandings of A. hadrus, exceptionally functional features, and potential applications.
IMPORTANCE: This work provides new insights into the evolutionary relationships, functional characteristics, prevalence, and potential applications of Anaerostipes hadrus.
Additional Links: PMID-38470044
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38470044,
year = {2024},
author = {Liu, D and Xie, L-S and Lian, S and Li, K and Yang, Y and Wang, W-Z and Hu, S and Liu, S-J and Liu, C and He, Z},
title = {Anaerostipes hadrus, a butyrate-producing bacterium capable of metabolizing 5-fluorouracil.},
journal = {mSphere},
volume = {},
number = {},
pages = {e0081623},
doi = {10.1128/msphere.00816-23},
pmid = {38470044},
issn = {2379-5042},
abstract = {UNLABELLED: Anaerostipes hadrus (A. hadrus) is a dominant species in the human gut microbiota and considered a beneficial bacterium for producing probiotic butyrate. However, recent studies have suggested that A. hadrus may negatively affect the host through synthesizing fatty acid and metabolizing the anticancer drug 5-fluorouracil, indicating that the impact of A. hadrus is complex and unclear. Therefore, comprehensive genomic studies on A. hadrus need to be performed. We integrated 527 high-quality public A. hadrus genomes and five distinct metagenomic cohorts. We analyzed these data using the approaches of comparative genomics, metagenomics, and protein structure prediction. We also performed validations with culture-based in vitro assays. We constructed the first large-scale pan-genome of A. hadrus (n = 527) and identified 5-fluorouracil metabolism genes as ubiquitous in A. hadrus genomes as butyrate-producing genes. Metagenomic analysis revealed the wide and stable distribution of A. hadrus in healthy individuals, patients with inflammatory bowel disease, and patients with colorectal cancer, with healthy individuals carrying more A. hadrus. The predicted high-quality protein structure indicated that A. hadrus might metabolize 5-fluorouracil by producing bacterial dihydropyrimidine dehydrogenase (encoded by the preTA operon). Through in vitro assays, we validated the short-chain fatty acid production and 5-fluorouracil metabolism abilities of A. hadrus. We observed for the first time that A. hadrus can convert 5-fluorouracil to α-fluoro-β-ureidopropionic acid, which may result from the combined action of the preTA operon and adjacent hydA (encoding bacterial dihydropyrimidinase). Our results offer novel understandings of A. hadrus, exceptionally functional features, and potential applications.
IMPORTANCE: This work provides new insights into the evolutionary relationships, functional characteristics, prevalence, and potential applications of Anaerostipes hadrus.},
}
RevDate: 2024-03-12
Pangenome Analysis of Helicobacter pylori Isolates from Selected Areas of Africa Indicated Diverse Antibiotic Resistance and Virulence Genes.
International journal of genomics, 2024:5536117.
The challenge facing Helicobacter pylori (H. pylori) infection management in some parts of Africa is the evolution of drug-resistant species, the lack of gold standard in diagnostic methods, and the ineffectiveness of current vaccines against the bacteria. It is being established that even though clinical consequences linked to the bacteria vary geographically, there is rather a generic approach to treatment. This situation has remained problematic in the successful fight against the bacteria in parts of Africa. As a result, this study compared the genomes of selected H. pylori isolates from selected areas of Africa and evaluated their virulence and antibiotic drug resistance, those that are highly pathogenic and are associated with specific clinical outcomes and those that are less virulent and rarely associated with clinical outcomes. 146 genomes of H. pylori isolated from selected locations of Africa were sampled, and bioinformatic tools such as Abricate, CARD RGI, MLST, Prokka, Roary, Phandango, Google Sheets, and iTOLS were used to compare the isolates and their antibiotic resistance or susceptibility. Over 20 k virulence and AMR genes were observed. About 95% of the isolates were genetically diverse, 90% of the isolates harbored shell genes, and 50% harbored cloud and core genes. Some isolates did not retain the cagA and vacA genes. Clarithromycin, metronidazole, amoxicillin, and tinidazole were resistant to most AMR genes (vacA, cagA, oip, and bab). Conclusion. This study found both virulence and AMR genes in all H. pylori strains in all the selected geographies around Africa with differing quantities. MLST, Pangenome, and ORF analyses showed disparities among the isolates. This in general could imply diversities in terms of genetics, evolution, and protein production. Therefore, generic administration of antibiotics such as clarithromycin, amoxicillin, and erythromycin as treatment methods in the African subregion could be contributing to the spread of the bacterium's antibiotic resistance.
Additional Links: PMID-38469580
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38469580,
year = {2024},
author = {Yakubu, B and Appiah, EM and Adu, AF},
title = {Pangenome Analysis of Helicobacter pylori Isolates from Selected Areas of Africa Indicated Diverse Antibiotic Resistance and Virulence Genes.},
journal = {International journal of genomics},
volume = {2024},
number = {},
pages = {5536117},
pmid = {38469580},
issn = {2314-4378},
abstract = {The challenge facing Helicobacter pylori (H. pylori) infection management in some parts of Africa is the evolution of drug-resistant species, the lack of gold standard in diagnostic methods, and the ineffectiveness of current vaccines against the bacteria. It is being established that even though clinical consequences linked to the bacteria vary geographically, there is rather a generic approach to treatment. This situation has remained problematic in the successful fight against the bacteria in parts of Africa. As a result, this study compared the genomes of selected H. pylori isolates from selected areas of Africa and evaluated their virulence and antibiotic drug resistance, those that are highly pathogenic and are associated with specific clinical outcomes and those that are less virulent and rarely associated with clinical outcomes. 146 genomes of H. pylori isolated from selected locations of Africa were sampled, and bioinformatic tools such as Abricate, CARD RGI, MLST, Prokka, Roary, Phandango, Google Sheets, and iTOLS were used to compare the isolates and their antibiotic resistance or susceptibility. Over 20 k virulence and AMR genes were observed. About 95% of the isolates were genetically diverse, 90% of the isolates harbored shell genes, and 50% harbored cloud and core genes. Some isolates did not retain the cagA and vacA genes. Clarithromycin, metronidazole, amoxicillin, and tinidazole were resistant to most AMR genes (vacA, cagA, oip, and bab). Conclusion. This study found both virulence and AMR genes in all H. pylori strains in all the selected geographies around Africa with differing quantities. MLST, Pangenome, and ORF analyses showed disparities among the isolates. This in general could imply diversities in terms of genetics, evolution, and protein production. Therefore, generic administration of antibiotics such as clarithromycin, amoxicillin, and erythromycin as treatment methods in the African subregion could be contributing to the spread of the bacterium's antibiotic resistance.},
}
RevDate: 2024-03-11
Distinct Escherichia coli transcriptional profiles in the guts of recurrent UTI sufferers revealed by pan-genome hybrid selection.
bioRxiv : the preprint server for biology pii:2024.02.29.582780.
Low-abundance members of microbial communities are difficult to study in their native habitat. This includes Escherichia coli , a minor, but common inhabitant of the gastrointestinal tract and opportunistic pathogen, including of the urinary tract, where it causes most infections. While our understanding of the interactions between uropathogenic Escherichia coli (UPEC) and the bladder is increasing, comparatively little is known about UPEC in its pre-infection reservoir, partly due to its low abundance there (<1% relative abundance). In order to specifically and sensitively explore the genomes and transcriptomes of diverse E. coli from gastrointestinal communities, we developed E. coli PanSelect, a set of probes designed to enrich E. coli 's broad pangenome. First we demonstrated the ability of PanSelect to enrich diverse strains in an unbiased way using a mock community of known composition. Then we enriched E. coli DNA and RNA from human stool microbiomes by 158 and 30-fold, respectively. We also used E. coli PanSelect to explore the gene content and transcriptome of E. coli within the gut microbiomes of women with history of recurrent urinary tract infection (rUTI), finding differential regulation of pathways that suggests that the rUTI gut environment promotes respiratory vs fermentative metabolism. E. coli PanSelect technology holds promise for investigations of native in vivo biology of diverse E. coli in the gut and other environments, where it is a minor component of the microbial community, using unbiased, culture-free shotgun sequencing. This method could also be generally applied to other highly diverse, low abundance bacteria.
Additional Links: PMID-38463963
Full Text:
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38463963,
year = {2024},
author = {Young, MG and Straub, TJ and Worby, CJ and Metsky, HC and Gnirke, A and Bronson, RA and van Dijk, LR and Desjardins, CA and Matranga, C and Qu, J and Dodson, K and Schreiber, HL and Manson, AL and Hultgren, SJ and Earl, AM},
title = {Distinct Escherichia coli transcriptional profiles in the guts of recurrent UTI sufferers revealed by pan-genome hybrid selection.},
journal = {bioRxiv : the preprint server for biology},
volume = {},
number = {},
pages = {},
doi = {10.1101/2024.02.29.582780},
pmid = {38463963},
abstract = {Low-abundance members of microbial communities are difficult to study in their native habitat. This includes Escherichia coli , a minor, but common inhabitant of the gastrointestinal tract and opportunistic pathogen, including of the urinary tract, where it causes most infections. While our understanding of the interactions between uropathogenic Escherichia coli (UPEC) and the bladder is increasing, comparatively little is known about UPEC in its pre-infection reservoir, partly due to its low abundance there (<1% relative abundance). In order to specifically and sensitively explore the genomes and transcriptomes of diverse E. coli from gastrointestinal communities, we developed E. coli PanSelect, a set of probes designed to enrich E. coli 's broad pangenome. First we demonstrated the ability of PanSelect to enrich diverse strains in an unbiased way using a mock community of known composition. Then we enriched E. coli DNA and RNA from human stool microbiomes by 158 and 30-fold, respectively. We also used E. coli PanSelect to explore the gene content and transcriptome of E. coli within the gut microbiomes of women with history of recurrent urinary tract infection (rUTI), finding differential regulation of pathways that suggests that the rUTI gut environment promotes respiratory vs fermentative metabolism. E. coli PanSelect technology holds promise for investigations of native in vivo biology of diverse E. coli in the gut and other environments, where it is a minor component of the microbial community, using unbiased, culture-free shotgun sequencing. This method could also be generally applied to other highly diverse, low abundance bacteria.},
}
RevDate: 2024-03-11
Exploring gene content with pangenome gene graphs.
ArXiv pii:2402.16185.
MOTIVATION: The gene content regulates the biology of an organism. It varies between species and between individuals of the same species. Although tools have been developed to identify gene content changes in bacterial genomes, none is applicable to collections of large eukaryotic genomes such as the human pangenome.
RESULTS: We developed pangene, a computational tool to identify gene orientation, gene order and gene copy-number changes in a collection of genomes. Pangene aligns a set of input protein sequences to the genomes, resolves redundancies between protein sequences and constructs a gene graph with each genome represented as a walk in the graph. It additionally finds subgraphs that encodes gene content changes. Applied to the human pangenome, pangene identifies known gene-level variations and reveals complex haplotypes that are not well studied before. Pangene also works with high-quality bacterial pangenome and reports similar numbers of core and accessory genes in comparison to existing tools.
Source code at https://github.com/lh3/pangene; pre-built pangene graphs can be downloaded from https://zenodo.org/records/8118576 and visualized at https://pangene.bioinweb.org.
Additional Links: PMID-38463499
Full Text:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38463499,
year = {2024},
author = {Li, H and Marin, M and Farhat, MR},
title = {Exploring gene content with pangenome gene graphs.},
journal = {ArXiv},
volume = {},
number = {},
pages = {},
pmid = {38463499},
issn = {2331-8422},
abstract = {MOTIVATION: The gene content regulates the biology of an organism. It varies between species and between individuals of the same species. Although tools have been developed to identify gene content changes in bacterial genomes, none is applicable to collections of large eukaryotic genomes such as the human pangenome.
RESULTS: We developed pangene, a computational tool to identify gene orientation, gene order and gene copy-number changes in a collection of genomes. Pangene aligns a set of input protein sequences to the genomes, resolves redundancies between protein sequences and constructs a gene graph with each genome represented as a walk in the graph. It additionally finds subgraphs that encodes gene content changes. Applied to the human pangenome, pangene identifies known gene-level variations and reveals complex haplotypes that are not well studied before. Pangene also works with high-quality bacterial pangenome and reports similar numbers of core and accessory genes in comparison to existing tools.
Source code at https://github.com/lh3/pangene; pre-built pangene graphs can be downloaded from https://zenodo.org/records/8118576 and visualized at https://pangene.bioinweb.org.},
}
RevDate: 2024-03-10
Biodegradation of phthalate acid esters and whole-genome analysis of a novel Streptomyces sp. FZ201 isolated from natural habitats.
Journal of hazardous materials, 469:133972 pii:S0304-3894(24)00551-X [Epub ahead of print].
Di-n-butyl phthalate (DBP) is one of the most extensively used phthalic acid esters (PAEs) and is considered to be an emerging, globally concerning pollutant. The genus Streptomyces holds promise as a degrader of various organic pollutants, but PAE biodegradation mechanisms by Streptomyces species remain unsolved. In this study, a novel PAE-degrading Streptomyces sp. FZ201 isolated from natural habitats efficiently degraded various PAEs. FZ201 had strong resilience against DBP and exhibited immediate degradation, with kinetics adhering to a first-order model. The comprehensive biodegradation of DBP involves de-esterification, β-oxidation, trans-esterification, and aromatic ring cleavage. FZ201 contains numerous catabolic genes that potentially facilitate PAE biodegradation. The DBP metabolic pathway was reconstructed by genome annotation and intermediate identification. Streptomyces species have an open pangenome with substantial genome expansion events during the evolutionary process, enabling extensive genetic diversity and highly plastic genomes within the Streptomyces genus. FZ201 had a diverse array of highly expressed genes associated with the degradation of PAEs, potentially contributing significantly to its adaptive advantage and efficiency of PAE degradation. Thus, FZ201 is a promising candidate for remediating highly PAE-contaminated environments. These findings enhance our preliminary understanding of the molecular mechanisms employed by Streptomyces for the removal of PAEs.
Additional Links: PMID-38461665
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38461665,
year = {2024},
author = {Feng, NX and Li, DW and Zhang, F and Bin, H and Huang, YT and Xiang, L and Liu, BL and Cai, QY and Li, YW and Xu, DL and Xie, Y and Mo, CH},
title = {Biodegradation of phthalate acid esters and whole-genome analysis of a novel Streptomyces sp. FZ201 isolated from natural habitats.},
journal = {Journal of hazardous materials},
volume = {469},
number = {},
pages = {133972},
doi = {10.1016/j.jhazmat.2024.133972},
pmid = {38461665},
issn = {1873-3336},
abstract = {Di-n-butyl phthalate (DBP) is one of the most extensively used phthalic acid esters (PAEs) and is considered to be an emerging, globally concerning pollutant. The genus Streptomyces holds promise as a degrader of various organic pollutants, but PAE biodegradation mechanisms by Streptomyces species remain unsolved. In this study, a novel PAE-degrading Streptomyces sp. FZ201 isolated from natural habitats efficiently degraded various PAEs. FZ201 had strong resilience against DBP and exhibited immediate degradation, with kinetics adhering to a first-order model. The comprehensive biodegradation of DBP involves de-esterification, β-oxidation, trans-esterification, and aromatic ring cleavage. FZ201 contains numerous catabolic genes that potentially facilitate PAE biodegradation. The DBP metabolic pathway was reconstructed by genome annotation and intermediate identification. Streptomyces species have an open pangenome with substantial genome expansion events during the evolutionary process, enabling extensive genetic diversity and highly plastic genomes within the Streptomyces genus. FZ201 had a diverse array of highly expressed genes associated with the degradation of PAEs, potentially contributing significantly to its adaptive advantage and efficiency of PAE degradation. Thus, FZ201 is a promising candidate for remediating highly PAE-contaminated environments. These findings enhance our preliminary understanding of the molecular mechanisms employed by Streptomyces for the removal of PAEs.},
}
RevDate: 2024-03-08
Whole-genome sequencing and analysis of Chryseobacterium arthrosphaerae from Rana nigromaculata.
BMC microbiology, 24(1):80.
Chryseobacterium arthrosphaerae strain FS91703 was isolated from Rana nigromaculata in our previous study. To investigate the genomic characteristics, pathogenicity-related genes, antimicrobial resistance, and phylogenetic relationship of this strain, PacBio RS II and Illumina HiSeq 2000 platforms were used for the whole genome sequencing. The genome size of strain FS91703 was 5,435,691 bp and GC content was 37.78%. A total of 4,951 coding genes were predicted; 99 potential virulence factors homologs were identified. Analysis of antibiotic resistance genes revealed that strain FS91703 harbored 10 antibiotic resistance genes in 6 categories and 2 multidrug-resistant efflux pump genes, including adeG and farA. Strain FS91703 was sensitive to β-lactam combination drugs, cephem, monobactam and carbapenems, intermediately resistant to phenicol, and resistant to penicillin, aminoglycosides, tetracycline, fluoroquinolones, and folate pathway inhibitors. Phylogenetic analysis revealed that strain FS91703 and C. arthrosphaerae CC-VM-7[T] were on the same branch of the phylogenetic tree based on 16 S rRNA; the ANI value between them was 96.99%; and the DDH values were 80.2, 72.2 and 81.6% by three default calculation formulae. These results suggested that strain FS91703 was a species of C. arthrosphaerae. Pan-genome analysis showed FS91703 had 566 unique genes compared with 13 other C. arthrosphaerae strains, and had a distant phylogenetic relationship with the other C. arthrosphaerae strains of the same branch in phylogenetic tree based on orthologous genes. The results of this study suggest that strain FS91703 is a multidrug-resistant and highly virulent bacterium, that differs from other C. arthrosphaerae strains at the genomic level. The knowledge about the genomic characteristics and antimicrobial resistance of strain FS91703 provides valuable insights into this rare species, as well as guidance for the treatment of the disease caused by FS91703 in Rana nigromaculata.
Additional Links: PMID-38459435
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38459435,
year = {2024},
author = {Zhu, L and Liu, H and Li, X and Shi, Y and Yin, X and Pi, X},
title = {Whole-genome sequencing and analysis of Chryseobacterium arthrosphaerae from Rana nigromaculata.},
journal = {BMC microbiology},
volume = {24},
number = {1},
pages = {80},
pmid = {38459435},
issn = {1471-2180},
support = {2022SNJF072//Zhejiang Provincial Science and Technology Cooperation Plan of "Three Rural Areas and Nine Rural Areas"/ ; 2022SNJF072//Zhejiang Provincial Science and Technology Cooperation Plan of "Three Rural Areas and Nine Rural Areas"/ ; 2022SNJF072//Zhejiang Provincial Science and Technology Cooperation Plan of "Three Rural Areas and Nine Rural Areas"/ ; 2022SNJF072//Zhejiang Provincial Science and Technology Cooperation Plan of "Three Rural Areas and Nine Rural Areas"/ ; 2022SNJF072//Zhejiang Provincial Science and Technology Cooperation Plan of "Three Rural Areas and Nine Rural Areas"/ ; 2022SNJF072//Zhejiang Provincial Science and Technology Cooperation Plan of "Three Rural Areas and Nine Rural Areas"/ ; },
abstract = {Chryseobacterium arthrosphaerae strain FS91703 was isolated from Rana nigromaculata in our previous study. To investigate the genomic characteristics, pathogenicity-related genes, antimicrobial resistance, and phylogenetic relationship of this strain, PacBio RS II and Illumina HiSeq 2000 platforms were used for the whole genome sequencing. The genome size of strain FS91703 was 5,435,691 bp and GC content was 37.78%. A total of 4,951 coding genes were predicted; 99 potential virulence factors homologs were identified. Analysis of antibiotic resistance genes revealed that strain FS91703 harbored 10 antibiotic resistance genes in 6 categories and 2 multidrug-resistant efflux pump genes, including adeG and farA. Strain FS91703 was sensitive to β-lactam combination drugs, cephem, monobactam and carbapenems, intermediately resistant to phenicol, and resistant to penicillin, aminoglycosides, tetracycline, fluoroquinolones, and folate pathway inhibitors. Phylogenetic analysis revealed that strain FS91703 and C. arthrosphaerae CC-VM-7[T] were on the same branch of the phylogenetic tree based on 16 S rRNA; the ANI value between them was 96.99%; and the DDH values were 80.2, 72.2 and 81.6% by three default calculation formulae. These results suggested that strain FS91703 was a species of C. arthrosphaerae. Pan-genome analysis showed FS91703 had 566 unique genes compared with 13 other C. arthrosphaerae strains, and had a distant phylogenetic relationship with the other C. arthrosphaerae strains of the same branch in phylogenetic tree based on orthologous genes. The results of this study suggest that strain FS91703 is a multidrug-resistant and highly virulent bacterium, that differs from other C. arthrosphaerae strains at the genomic level. The knowledge about the genomic characteristics and antimicrobial resistance of strain FS91703 provides valuable insights into this rare species, as well as guidance for the treatment of the disease caused by FS91703 in Rana nigromaculata.},
}
RevDate: 2024-03-07
Pan-genome analysis of Streptococcus suis serotype 2 highlights genes associated with virulence and antibiotic resistance.
Frontiers in microbiology, 15:1362316.
Streptococcus suis serotype 2 (SS2) is a Gram-positive bacterium. It is a common and significant pathogen in pigs and a common cause of zoonotic meningitis in humans. It can lead to sepsis, endocarditis, arthritis, and pneumonia. If not diagnosed and treated promptly, it has a high mortality rate. The pan-genome of SS2 is open, and with an increasing number of genes, the core genome and accessory genome may exhibit more pronounced differences. Due to the diversity of SS2, the genes related to its virulence and resistance are still unclear. In this study, a strain of SS2 was isolated from a pig farm in Sichuan Province, China, and subjected to whole-genome sequencing and characterization. Subsequently, we conducted a Pan-Genome-Wide Association Study (Pan-GWAS) on 230 strains of SS2. Our analysis indicates that the core genome is composed of 1,458 genes related to the basic life processes of the bacterium. The accessory genome, consisting of 4,337 genes, is highly variable and a major contributor to the genetic diversity of SS2. Furthermore, we identified important virulence and resistance genes in SS2 through pan-GWAS. The virulence genes of SS2 are mainly associated with bacterial adhesion. In addition, resistance genes in the core genome may confer natural resistance of SS2 to fluoroquinolone and glycopeptide antibiotics. This study lays the foundation for further research on the virulence and resistance of SS2, providing potential new drug and vaccine targets against SS2.
Additional Links: PMID-38450165
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38450165,
year = {2024},
author = {Zhou, Y and Tu, T and Yao, X and Luo, Y and Yang, Z and Ren, M and Zhang, G and Yu, Y and Lu, A and Wang, Y},
title = {Pan-genome analysis of Streptococcus suis serotype 2 highlights genes associated with virulence and antibiotic resistance.},
journal = {Frontiers in microbiology},
volume = {15},
number = {},
pages = {1362316},
pmid = {38450165},
issn = {1664-302X},
abstract = {Streptococcus suis serotype 2 (SS2) is a Gram-positive bacterium. It is a common and significant pathogen in pigs and a common cause of zoonotic meningitis in humans. It can lead to sepsis, endocarditis, arthritis, and pneumonia. If not diagnosed and treated promptly, it has a high mortality rate. The pan-genome of SS2 is open, and with an increasing number of genes, the core genome and accessory genome may exhibit more pronounced differences. Due to the diversity of SS2, the genes related to its virulence and resistance are still unclear. In this study, a strain of SS2 was isolated from a pig farm in Sichuan Province, China, and subjected to whole-genome sequencing and characterization. Subsequently, we conducted a Pan-Genome-Wide Association Study (Pan-GWAS) on 230 strains of SS2. Our analysis indicates that the core genome is composed of 1,458 genes related to the basic life processes of the bacterium. The accessory genome, consisting of 4,337 genes, is highly variable and a major contributor to the genetic diversity of SS2. Furthermore, we identified important virulence and resistance genes in SS2 through pan-GWAS. The virulence genes of SS2 are mainly associated with bacterial adhesion. In addition, resistance genes in the core genome may confer natural resistance of SS2 to fluoroquinolone and glycopeptide antibiotics. This study lays the foundation for further research on the virulence and resistance of SS2, providing potential new drug and vaccine targets against SS2.},
}
RevDate: 2024-03-06
Recent advances in plant translational genomics for crop improvement.
Advances in protein chemistry and structural biology, 139:335-382.
The growing population, climate change, and limited agricultural resources put enormous pressure on agricultural systems. A plateau in crop yields is occurring and extreme weather events and urbanization threaten the livelihood of farmers. It is imperative that immediate attention is paid to addressing the increasing food demand, ensuring resilience against emerging threats, and meeting the demand for more nutritious, safer food. Under uncertain conditions, it is essential to expand genetic diversity and discover novel crop varieties or variations to develop higher and more stable yields. Genomics plays a significant role in developing abundant and nutrient-dense food crops. An alternative to traditional breeding approach, translational genomics is able to improve breeding programs in a more efficient and precise manner by translating genomic concepts into practical tools. Crop breeding based on genomics offers potential solutions to overcome the limitations of conventional breeding methods, including improved crop varieties that provide more nutritional value and are protected from biotic and abiotic stresses. Genetic markers, such as SNPs and ESTs, contribute to the discovery of QTLs controlling agronomic traits and stress tolerance. In order to meet the growing demand for food, there is a need to incorporate QTLs into breeding programs using marker-assisted selection/breeding and transgenic technologies. This chapter primarily focuses on the recent advances that are made in translational genomics for crop improvement and various omics techniques including transcriptomics, metagenomics, pangenomics, single cell omics etc. Numerous genome editing techniques including CRISPR Cas technology and their applications in crop improvement had been discussed.
Additional Links: PMID-38448140
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38448140,
year = {2024},
author = {Mathur, S and Singh, D and Ranjan, R},
title = {Recent advances in plant translational genomics for crop improvement.},
journal = {Advances in protein chemistry and structural biology},
volume = {139},
number = {},
pages = {335-382},
doi = {10.1016/bs.apcsb.2023.11.009},
pmid = {38448140},
issn = {1876-1631},
abstract = {The growing population, climate change, and limited agricultural resources put enormous pressure on agricultural systems. A plateau in crop yields is occurring and extreme weather events and urbanization threaten the livelihood of farmers. It is imperative that immediate attention is paid to addressing the increasing food demand, ensuring resilience against emerging threats, and meeting the demand for more nutritious, safer food. Under uncertain conditions, it is essential to expand genetic diversity and discover novel crop varieties or variations to develop higher and more stable yields. Genomics plays a significant role in developing abundant and nutrient-dense food crops. An alternative to traditional breeding approach, translational genomics is able to improve breeding programs in a more efficient and precise manner by translating genomic concepts into practical tools. Crop breeding based on genomics offers potential solutions to overcome the limitations of conventional breeding methods, including improved crop varieties that provide more nutritional value and are protected from biotic and abiotic stresses. Genetic markers, such as SNPs and ESTs, contribute to the discovery of QTLs controlling agronomic traits and stress tolerance. In order to meet the growing demand for food, there is a need to incorporate QTLs into breeding programs using marker-assisted selection/breeding and transgenic technologies. This chapter primarily focuses on the recent advances that are made in translational genomics for crop improvement and various omics techniques including transcriptomics, metagenomics, pangenomics, single cell omics etc. Numerous genome editing techniques including CRISPR Cas technology and their applications in crop improvement had been discussed.},
}
RevDate: 2024-03-04
Three near-complete genome assemblies reveal substantial centromere dynamics from diploid to tetraploid in Brachypodium genus.
Genome biology, 25(1):63.
BACKGROUND: Centromeres are critical for maintaining genomic stability in eukaryotes, and their turnover shapes genome architectures and drives karyotype evolution. However, the co-evolution of centromeres from different species in allopolyploids over millions of years remains largely unknown.
RESULTS: Here, we generate three near-complete genome assemblies, a tetraploid Brachypodium hybridum and its two diploid ancestors, Brachypodium distachyon and Brachypodium stacei. We detect high degrees of sequence, structural, and epigenetic variations of centromeres at base-pair resolution between closely related Brachypodium genomes, indicating the appearance and accumulation of species-specific centromere repeats from a common origin during evolution. We also find that centromere homogenization is accompanied by local satellite repeats bursting and retrotransposon purging, and the frequency of retrotransposon invasions drives the degree of interspecies centromere diversification. We further investigate the dynamics of centromeres during alloploidization process, and find that dramatic genetics and epigenetics architecture variations are associated with the turnover of centromeres between homologous chromosomal pairs from diploid to tetraploid. Additionally, our pangenomes analysis reveals the ongoing variations of satellite repeats and stable evolutionary homeostasis within centromeres among individuals of each Brachypodium genome with different polyploidy levels.
CONCLUSIONS: Our results provide unprecedented information on the genomic, epigenomic, and functional diversity of highly repetitive DNA between closely related species and their allopolyploid genomes at both coarse and fine scale.
Additional Links: PMID-38439049
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38439049,
year = {2024},
author = {Chen, C and Wu, S and Sun, Y and Zhou, J and Chen, Y and Zhang, J and Birchler, JA and Han, F and Yang, N and Su, H},
title = {Three near-complete genome assemblies reveal substantial centromere dynamics from diploid to tetraploid in Brachypodium genus.},
journal = {Genome biology},
volume = {25},
number = {1},
pages = {63},
pmid = {38439049},
issn = {1474-760X},
support = {2021YFF1000800//National Key Research and Development Program of China/ ; 32170571//National Natural Science Foundation of China/ ; 2021ZKPY008//Fundamental Research Funds for the Central Universities/ ; No. B21HJ0504//Hainan Yazhou Bay Seed Laboratory/ ; },
abstract = {BACKGROUND: Centromeres are critical for maintaining genomic stability in eukaryotes, and their turnover shapes genome architectures and drives karyotype evolution. However, the co-evolution of centromeres from different species in allopolyploids over millions of years remains largely unknown.
RESULTS: Here, we generate three near-complete genome assemblies, a tetraploid Brachypodium hybridum and its two diploid ancestors, Brachypodium distachyon and Brachypodium stacei. We detect high degrees of sequence, structural, and epigenetic variations of centromeres at base-pair resolution between closely related Brachypodium genomes, indicating the appearance and accumulation of species-specific centromere repeats from a common origin during evolution. We also find that centromere homogenization is accompanied by local satellite repeats bursting and retrotransposon purging, and the frequency of retrotransposon invasions drives the degree of interspecies centromere diversification. We further investigate the dynamics of centromeres during alloploidization process, and find that dramatic genetics and epigenetics architecture variations are associated with the turnover of centromeres between homologous chromosomal pairs from diploid to tetraploid. Additionally, our pangenomes analysis reveals the ongoing variations of satellite repeats and stable evolutionary homeostasis within centromeres among individuals of each Brachypodium genome with different polyploidy levels.
CONCLUSIONS: Our results provide unprecedented information on the genomic, epigenomic, and functional diversity of highly repetitive DNA between closely related species and their allopolyploid genomes at both coarse and fine scale.},
}
RevDate: 2024-03-04
Genomics-based analysis of four porcine-derived lactic acid bacteria strains and their evaluation as potential probiotics.
Molecular genetics and genomics : MGG, 299(1):24.
The search for probiotics and exploration of their functions are crucial for livestock farming. Recently, porcine-derived lactic acid bacteria (LAB) have shown great potential as probiotics. However, research on the evaluation of porcine-derived LAB as potential probiotics through genomics-based analysis is relatively limited. The present study analyzed four porcine-derived LAB strains (Lactobacillus johnsonii L16, Latilactobacillus curvatus ZHA1, Ligilactobacillus salivarius ZSA5 and Ligilactobacillus animalis ZSB1) using genomic techniques and combined with in vitro tests to evaluate their potential as probiotics. The genome sizes of the four strains ranged from 1,897,301 bp to 2,318,470 bp with the GC contents from 33.03 to 41.97%. Pan-genomic analysis and collinearity analysis indicated differences among the genomes of four strains. Carbohydrate active enzymes analysis revealed that L. johnsonii L16 encoded more carbohydrate active enzymes than other strains. KEGG pathway analysis and in vitro tests confirmed that L. johnsonii L16 could utilize a wide range of carbohydrates and had good utilization capacity for each carbohydrate. The four strains had genes related to acid tolerance and were tolerant to low pH, with L. johnsonii L16 showing the greatest tolerance. The four strains contained genes related to bile salt tolerance and were able to tolerate 0.1% bile salt. Four strains had antioxidant related genes and exhibited antioxidant activity in in vitro tests. They contained the genes linked with organic acid biosynthesis and exhibited antibacterial activity against enterotoxigenic Escherichia coli K88 (ETEC K88) and Salmonella 6,7:c:1,5, wherein, L. johnsonii L16 and L. salivarius ZSA5 had gene clusters encoding bacteriocin. Results suggest that genome analysis combined with in vitro tests is an effective approach for evaluating different strains as probiotics. The findings of this study indicate that L. johnsonii L16 has the potential as a probiotic strain among the four strains and provide theoretical basis for the development of probiotics in swine production.
Additional Links: PMID-38438804
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38438804,
year = {2024},
author = {Niu, D and Feng, N and Xi, S and Xu, J and Su, Y},
title = {Genomics-based analysis of four porcine-derived lactic acid bacteria strains and their evaluation as potential probiotics.},
journal = {Molecular genetics and genomics : MGG},
volume = {299},
number = {1},
pages = {24},
pmid = {38438804},
issn = {1617-4623},
support = {2022YFD1300402//Key Technologies Research and Development Program/ ; 31872362//National Natural Science Foundation of China/ ; 32072688//National Natural Science Foundation of China/ ; },
abstract = {The search for probiotics and exploration of their functions are crucial for livestock farming. Recently, porcine-derived lactic acid bacteria (LAB) have shown great potential as probiotics. However, research on the evaluation of porcine-derived LAB as potential probiotics through genomics-based analysis is relatively limited. The present study analyzed four porcine-derived LAB strains (Lactobacillus johnsonii L16, Latilactobacillus curvatus ZHA1, Ligilactobacillus salivarius ZSA5 and Ligilactobacillus animalis ZSB1) using genomic techniques and combined with in vitro tests to evaluate their potential as probiotics. The genome sizes of the four strains ranged from 1,897,301 bp to 2,318,470 bp with the GC contents from 33.03 to 41.97%. Pan-genomic analysis and collinearity analysis indicated differences among the genomes of four strains. Carbohydrate active enzymes analysis revealed that L. johnsonii L16 encoded more carbohydrate active enzymes than other strains. KEGG pathway analysis and in vitro tests confirmed that L. johnsonii L16 could utilize a wide range of carbohydrates and had good utilization capacity for each carbohydrate. The four strains had genes related to acid tolerance and were tolerant to low pH, with L. johnsonii L16 showing the greatest tolerance. The four strains contained genes related to bile salt tolerance and were able to tolerate 0.1% bile salt. Four strains had antioxidant related genes and exhibited antioxidant activity in in vitro tests. They contained the genes linked with organic acid biosynthesis and exhibited antibacterial activity against enterotoxigenic Escherichia coli K88 (ETEC K88) and Salmonella 6,7:c:1,5, wherein, L. johnsonii L16 and L. salivarius ZSA5 had gene clusters encoding bacteriocin. Results suggest that genome analysis combined with in vitro tests is an effective approach for evaluating different strains as probiotics. The findings of this study indicate that L. johnsonii L16 has the potential as a probiotic strain among the four strains and provide theoretical basis for the development of probiotics in swine production.},
}
RevDate: 2024-03-01
CmpDate: 2024-03-01
Comparative genomics reveals distinct diversification patterns among LysR-type transcriptional regulators in the ESKAPE pathogen Pseudomonas aeruginosa.
Microbial genomics, 10(2):.
Pseudomonas aeruginosa, a harmful nosocomial pathogen associated with cystic fibrosis and burn wounds, encodes for a large number of LysR-type transcriptional regulator proteins. To understand how and why LTTR proteins evolved with such frequency and to establish whether any relationships exist within the distribution we set out to identify the patterns underpinning LTTR distribution in P. aeruginosa and to uncover cluster-based relationships within the pangenome. Comparative genomic studies revealed that in the JGI IMG database alone ~86 000 LTTRs are present across the sequenced genomes (n=699). They are widely distributed across the species, with core LTTRs present in >93 % of the genomes and accessory LTTRs present in <7 %. Analysis showed that subsets of core LTTRs can be classified as either variable (typically specific to P. aeruginosa) or conserved (and found to be distributed in other Pseudomonas species). Extending the analysis to the more extensive Pseudomonas database, PA14 rooted analysis confirmed the diversification patterns and revealed PqsR, the receptor for the Pseudomonas quinolone signal (PQS) and 2-heptyl-4-quinolone (HHQ) quorum-sensing signals, to be amongst the most variable in the dataset. Successful complementation of the PAO1 pqsR [-] mutant using representative variant pqsR sequences suggests a degree of structural promiscuity within the most variable of LTTRs, several of which play a prominent role in signalling and communication. These findings provide a new insight into the diversification of LTTR proteins within the P. aeruginosa species and suggests a functional significance to the cluster, conservation and distribution patterns identified.
Additional Links: PMID-38421269
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38421269,
year = {2024},
author = {Deery, J and Carmody, M and Flavin, R and Tomanek, M and O'Keeffe, M and McGlacken, GP and Reen, FJ},
title = {Comparative genomics reveals distinct diversification patterns among LysR-type transcriptional regulators in the ESKAPE pathogen Pseudomonas aeruginosa.},
journal = {Microbial genomics},
volume = {10},
number = {2},
pages = {},
doi = {10.1099/mgen.0.001205},
pmid = {38421269},
issn = {2057-5858},
mesh = {Humans ; Pseudomonas aeruginosa/genetics ; Genomics ; *Pseudomonas Infections ; Pseudomonas ; *Cystic Fibrosis/genetics ; },
abstract = {Pseudomonas aeruginosa, a harmful nosocomial pathogen associated with cystic fibrosis and burn wounds, encodes for a large number of LysR-type transcriptional regulator proteins. To understand how and why LTTR proteins evolved with such frequency and to establish whether any relationships exist within the distribution we set out to identify the patterns underpinning LTTR distribution in P. aeruginosa and to uncover cluster-based relationships within the pangenome. Comparative genomic studies revealed that in the JGI IMG database alone ~86 000 LTTRs are present across the sequenced genomes (n=699). They are widely distributed across the species, with core LTTRs present in >93 % of the genomes and accessory LTTRs present in <7 %. Analysis showed that subsets of core LTTRs can be classified as either variable (typically specific to P. aeruginosa) or conserved (and found to be distributed in other Pseudomonas species). Extending the analysis to the more extensive Pseudomonas database, PA14 rooted analysis confirmed the diversification patterns and revealed PqsR, the receptor for the Pseudomonas quinolone signal (PQS) and 2-heptyl-4-quinolone (HHQ) quorum-sensing signals, to be amongst the most variable in the dataset. Successful complementation of the PAO1 pqsR [-] mutant using representative variant pqsR sequences suggests a degree of structural promiscuity within the most variable of LTTRs, several of which play a prominent role in signalling and communication. These findings provide a new insight into the diversification of LTTR proteins within the P. aeruginosa species and suggests a functional significance to the cluster, conservation and distribution patterns identified.},
}
MeSH Terms:
show MeSH Terms
hide MeSH Terms
Humans
Pseudomonas aeruginosa/genetics
Genomics
*Pseudomonas Infections
Pseudomonas
*Cystic Fibrosis/genetics
RevDate: 2024-02-29
The chromosome-scale genome of wild Brassica oleracea provides insights into the domestication of Brassica plants.
Journal of experimental botany pii:7616234 [Epub ahead of print].
The cultivated diploid Brassica oleracea is an important vegetable crop, but the genetic basis of domestication remains largely unclear without high-quality reference genomes of wild B. oleracea. Here, we report the first chromosome-level assembly of the wild Brassica oleracea L. W03 genome, (total genome size, 630.7 Mb; scaffold N50, 64.6 Mb). Using newly assembled W03 genome, we constructed a gene-based B. oleracea pangenome and identified 29,744 core genes, 23,306 dispensable genes, and 1,896 private genes. We resequenced 53 accessions, which represent six potential wild B. oleracea progenitor species. The results of the population genomic analysis showed that wild B. oleracea population had the highest level of diversity and represented the more closely related population of horticultural B. oleracea. Additionally, the WUSCHEL gene was found to play a decisive role in domestication and to be involved in cauliflower and broccoli curd formation. We also illustrate the loss of disease resistance genes during domestication selection. Our results provide deep insights into B. oleracea domestication and will facilitate Brassica crop genetic improvement.
Additional Links: PMID-38421062
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38421062,
year = {2024},
author = {Ji, G and Long, Y and Cai, G and Wang, A and Yan, G and Li, H and Gao, G and Xu, K and Huang, Q and Chen, B and Li, L and Li, F and Nishio, T and Shen, J and Wu, X},
title = {The chromosome-scale genome of wild Brassica oleracea provides insights into the domestication of Brassica plants.},
journal = {Journal of experimental botany},
volume = {},
number = {},
pages = {},
doi = {10.1093/jxb/erae079},
pmid = {38421062},
issn = {1460-2431},
abstract = {The cultivated diploid Brassica oleracea is an important vegetable crop, but the genetic basis of domestication remains largely unclear without high-quality reference genomes of wild B. oleracea. Here, we report the first chromosome-level assembly of the wild Brassica oleracea L. W03 genome, (total genome size, 630.7 Mb; scaffold N50, 64.6 Mb). Using newly assembled W03 genome, we constructed a gene-based B. oleracea pangenome and identified 29,744 core genes, 23,306 dispensable genes, and 1,896 private genes. We resequenced 53 accessions, which represent six potential wild B. oleracea progenitor species. The results of the population genomic analysis showed that wild B. oleracea population had the highest level of diversity and represented the more closely related population of horticultural B. oleracea. Additionally, the WUSCHEL gene was found to play a decisive role in domestication and to be involved in cauliflower and broccoli curd formation. We also illustrate the loss of disease resistance genes during domestication selection. Our results provide deep insights into B. oleracea domestication and will facilitate Brassica crop genetic improvement.},
}
RevDate: 2024-02-28
Proteomics-based vaccine targets annotation and design of multi-epitope vaccine against antibiotic-resistant Streptococcus gallolyticus.
Scientific reports, 14(1):4836.
Streptococcus gallolyticus is a non-motile, gram-positive bacterium that causes infective endocarditis. S. gallolyticus has developed resistance to existing antibiotics, and no vaccine is currently available. Therefore, it is essential to develop an effective S. gallolyticus vaccine. Core proteomics was used in this study together with subtractive proteomics and reverse vaccinology approach to find antigenic proteins that could be utilized for the design of the S. gallolyticus multi-epitope vaccine. The pipeline identified two antigenic proteins as potential vaccine targets: penicillin-binding protein and the ATP synthase subunit. T and B cell epitopes from the specific proteins were forecasted employing several immunoinformatics and bioinformatics resources. A vaccine (360 amino acids) was created using a combination of seven cytotoxic T cell lymphocyte (CTL), three helper T cell lymphocyte (HTL), and five linear B cell lymphocyte (LBL) epitopes. To increase immune responses, the vaccine was paired with a cholera enterotoxin subunit B (CTB) adjuvant. The developed vaccine was highly antigenic, non-allergenic, and stable for human use. The vaccine's binding affinity and molecular interactions with the human immunological receptor TLR4 were studied using molecular mechanics/generalized Born surface area (MMGBSA), molecular docking, and molecular dynamic (MD) simulation analyses. Escherichia coli (strain K12) plasmid vector pET-28a (+) was used to examine the ability of the vaccine to be expressed. According to the outcomes of these computer experiments, the vaccine is quite promising in terms of developing a protective immunity against diseases. However, in vitro and animal research are required to validate our findings.
Additional Links: PMID-38418560
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38418560,
year = {2024},
author = {Chao, P and Zhang, X and Zhang, L and Yang, A and Wang, Y and Chen, X},
title = {Proteomics-based vaccine targets annotation and design of multi-epitope vaccine against antibiotic-resistant Streptococcus gallolyticus.},
journal = {Scientific reports},
volume = {14},
number = {1},
pages = {4836},
pmid = {38418560},
issn = {2045-2322},
abstract = {Streptococcus gallolyticus is a non-motile, gram-positive bacterium that causes infective endocarditis. S. gallolyticus has developed resistance to existing antibiotics, and no vaccine is currently available. Therefore, it is essential to develop an effective S. gallolyticus vaccine. Core proteomics was used in this study together with subtractive proteomics and reverse vaccinology approach to find antigenic proteins that could be utilized for the design of the S. gallolyticus multi-epitope vaccine. The pipeline identified two antigenic proteins as potential vaccine targets: penicillin-binding protein and the ATP synthase subunit. T and B cell epitopes from the specific proteins were forecasted employing several immunoinformatics and bioinformatics resources. A vaccine (360 amino acids) was created using a combination of seven cytotoxic T cell lymphocyte (CTL), three helper T cell lymphocyte (HTL), and five linear B cell lymphocyte (LBL) epitopes. To increase immune responses, the vaccine was paired with a cholera enterotoxin subunit B (CTB) adjuvant. The developed vaccine was highly antigenic, non-allergenic, and stable for human use. The vaccine's binding affinity and molecular interactions with the human immunological receptor TLR4 were studied using molecular mechanics/generalized Born surface area (MMGBSA), molecular docking, and molecular dynamic (MD) simulation analyses. Escherichia coli (strain K12) plasmid vector pET-28a (+) was used to examine the ability of the vaccine to be expressed. According to the outcomes of these computer experiments, the vaccine is quite promising in terms of developing a protective immunity against diseases. However, in vitro and animal research are required to validate our findings.},
}
RevDate: 2024-02-28
Exploring the resistome and virulome in major sequence types of Acinetobacter baumannii genomes: Correlations with genome divergence and sequence types.
Infection, genetics and evolution : journal of molecular epidemiology and evolutionary genetics in infectious diseases pii:S1567-1348(24)00030-3 [Epub ahead of print].
The increasing global prevalence of antimicrobial resistance in Acinetobacter baumannii has led to concerns regarding the effectiveness of infection treatment. Moreover, the critical role of virulence factor genes in A. baumannii's pathogenesis and its propensity to cause severe disease is of particular importance. Comparative genomics, including multi-locus sequence typing (MLST), enhances our understanding of A. baumannii epidemiology. While there is substantial documentation on A. baumannii, a comprehensive study of the antibiotic-resistant mechanisms and the virulence factors contributing to pathogenesis, and their correlation with Sequence Types (STs) remains incompletely elucidated. In this study, we aim to explore the relationship between antimicrobial resistance genes, virulence factor genes, and STs using genomic data from 223 publicly available A. baumannii strains. The core phylogeny analysis revealed five predominant STs in A. baumannii genomes, linked to their geographical sources of isolation. Furthermore, the resistome and virulome of A. baumannii followed an evolutionary pattern consistent with their pan-genome evolution. Among the major STs, we observed significant variations in resistant genes against "aminoglycoside" and "sulphonamide" antibiotics, highlighting the role of genotypic variations in determining resistance profiles. Furthermore, the presence of virulence factor genes, particularly exotoxin and nutritional / metabolic factor genes, played a crucial role in distinguishing the major STs, suggesting a potential link between genetic makeup and pathogenicity. Understanding these associations can provide valuable insights into A. baumannii's virulence potential and clinical outcomes, enabling the development of effective strategies to combat infections caused by this opportunistic pathogen.
Additional Links: PMID-38417638
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38417638,
year = {2024},
author = {Banerjee, R and Robinson, SM and Lahiri, A and Verma, P and Banerjee, AK and Basak, S and Basak, K and Paul, S},
title = {Exploring the resistome and virulome in major sequence types of Acinetobacter baumannii genomes: Correlations with genome divergence and sequence types.},
journal = {Infection, genetics and evolution : journal of molecular epidemiology and evolutionary genetics in infectious diseases},
volume = {},
number = {},
pages = {105579},
doi = {10.1016/j.meegid.2024.105579},
pmid = {38417638},
issn = {1567-7257},
abstract = {The increasing global prevalence of antimicrobial resistance in Acinetobacter baumannii has led to concerns regarding the effectiveness of infection treatment. Moreover, the critical role of virulence factor genes in A. baumannii's pathogenesis and its propensity to cause severe disease is of particular importance. Comparative genomics, including multi-locus sequence typing (MLST), enhances our understanding of A. baumannii epidemiology. While there is substantial documentation on A. baumannii, a comprehensive study of the antibiotic-resistant mechanisms and the virulence factors contributing to pathogenesis, and their correlation with Sequence Types (STs) remains incompletely elucidated. In this study, we aim to explore the relationship between antimicrobial resistance genes, virulence factor genes, and STs using genomic data from 223 publicly available A. baumannii strains. The core phylogeny analysis revealed five predominant STs in A. baumannii genomes, linked to their geographical sources of isolation. Furthermore, the resistome and virulome of A. baumannii followed an evolutionary pattern consistent with their pan-genome evolution. Among the major STs, we observed significant variations in resistant genes against "aminoglycoside" and "sulphonamide" antibiotics, highlighting the role of genotypic variations in determining resistance profiles. Furthermore, the presence of virulence factor genes, particularly exotoxin and nutritional / metabolic factor genes, played a crucial role in distinguishing the major STs, suggesting a potential link between genetic makeup and pathogenicity. Understanding these associations can provide valuable insights into A. baumannii's virulence potential and clinical outcomes, enabling the development of effective strategies to combat infections caused by this opportunistic pathogen.},
}
RevDate: 2024-02-28
Genomic epidemiology of the primary methicillin-resistant Staphylococcus aureus clones causing invasive infections in Paraguayan children.
Microbiology spectrum [Epub ahead of print].
UNLABELLED: Methicillin-resistant Staphylococcus aureus (MRSA) is one of the major human pathogens. It could carry numerous resistance genes and virulence factors in its genome, some of which are related to the severity of the infection. An observational, descriptive, cross-sectional study was designed to molecularly analyze MRSA isolates that cause invasive infections in Paraguayan children from 2009 to 2013. Ten representative MRSA isolates of the main clonal complex identified were analyzed with short-read paired-end sequencing and assessed for the virulome, resistome, and phylogenetic relationships. All the genetically linked MRSA isolates were recovered from diverse clinical sources, patients, and hospitals at broad gap periods. The pan-genomic analysis of these clones revealed three major and different clonal complexes (CC30, CC5, and CC8), each composed of clones closely related to each other. The CC30 genomes prove to be a successful clone, strongly installed and disseminated throughout our country, and closely related to other CC30 public genomes from the region and the world. The CC5 shows the highest genetic variability, and the CC8 carried the complete arginine catabolic mobile element (ACME), closely related to the USA300-NAE-ACME+, identified as the major cause of CA-MRSA infections in North America. Multiple virulence and resistance genes were identified for the first time in this study, highlighting the complex virulence profiles of MRSA circulating in the country. This study opens a wide range of new possibilities for future projects and trials to improve the existing knowledge on the epidemiology of MRSA circulating in Paraguay.
IMPORTANCE: The increasing prevalence of methicillin-resistant Staphylococcus aureus (MRSA) is a public health problem worldwide. The most frequent MRSA clones identified in Paraguay in previous studies (including community and hospital acquired) were the Pediatric (CC5-ST5-IV), the Cordobes-Chilean (CC5-ST5-I), the SouthWest Pacific (CC30-ST30-IV), and the Brazilian (CC8-ST239-III) clones. In this study, the pan-genomic analysis of the most representative MRSA clones circulating in invasive infection in Paraguayan children over the years 2009-2013, such as the CC30-ST30-IV, CC5-ST5-IV, and CC8-ST8-IV, was carried out to evaluate their genetic diversity, their repertoire of virulence factors, and antimicrobial resistance determinants. This revealed multiple virulence and resistance genes, highlighting the complex virulence profiles of MRSA circulating in Paraguay. Our work is the first genomic study of MRSA in Paraguay and will contribute to the development of genomic surveillance in the region and our understanding of the global epidemiology of this pathogen.
Additional Links: PMID-38415665
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38415665,
year = {2024},
author = {Guillén, R and Salinas, C and Mendoza-Álvarez, A and Rubio Rodríguez, LA and Díaz-de Usera, A and Lorenzo-Salazar, JM and González-Montelongo, R and Flores, C and Rodríguez, F},
title = {Genomic epidemiology of the primary methicillin-resistant Staphylococcus aureus clones causing invasive infections in Paraguayan children.},
journal = {Microbiology spectrum},
volume = {},
number = {},
pages = {e0301223},
doi = {10.1128/spectrum.03012-23},
pmid = {38415665},
issn = {2165-0497},
abstract = {UNLABELLED: Methicillin-resistant Staphylococcus aureus (MRSA) is one of the major human pathogens. It could carry numerous resistance genes and virulence factors in its genome, some of which are related to the severity of the infection. An observational, descriptive, cross-sectional study was designed to molecularly analyze MRSA isolates that cause invasive infections in Paraguayan children from 2009 to 2013. Ten representative MRSA isolates of the main clonal complex identified were analyzed with short-read paired-end sequencing and assessed for the virulome, resistome, and phylogenetic relationships. All the genetically linked MRSA isolates were recovered from diverse clinical sources, patients, and hospitals at broad gap periods. The pan-genomic analysis of these clones revealed three major and different clonal complexes (CC30, CC5, and CC8), each composed of clones closely related to each other. The CC30 genomes prove to be a successful clone, strongly installed and disseminated throughout our country, and closely related to other CC30 public genomes from the region and the world. The CC5 shows the highest genetic variability, and the CC8 carried the complete arginine catabolic mobile element (ACME), closely related to the USA300-NAE-ACME+, identified as the major cause of CA-MRSA infections in North America. Multiple virulence and resistance genes were identified for the first time in this study, highlighting the complex virulence profiles of MRSA circulating in the country. This study opens a wide range of new possibilities for future projects and trials to improve the existing knowledge on the epidemiology of MRSA circulating in Paraguay.
IMPORTANCE: The increasing prevalence of methicillin-resistant Staphylococcus aureus (MRSA) is a public health problem worldwide. The most frequent MRSA clones identified in Paraguay in previous studies (including community and hospital acquired) were the Pediatric (CC5-ST5-IV), the Cordobes-Chilean (CC5-ST5-I), the SouthWest Pacific (CC30-ST30-IV), and the Brazilian (CC8-ST239-III) clones. In this study, the pan-genomic analysis of the most representative MRSA clones circulating in invasive infection in Paraguayan children over the years 2009-2013, such as the CC30-ST30-IV, CC5-ST5-IV, and CC8-ST8-IV, was carried out to evaluate their genetic diversity, their repertoire of virulence factors, and antimicrobial resistance determinants. This revealed multiple virulence and resistance genes, highlighting the complex virulence profiles of MRSA circulating in Paraguay. Our work is the first genomic study of MRSA in Paraguay and will contribute to the development of genomic surveillance in the region and our understanding of the global epidemiology of this pathogen.},
}
RevDate: 2024-02-27
Pangenome analysis of Shewanella xiamenensis revealed important genetic traits concerning genetic diversity, pathogenicity and antibiotic resistance.
BMC genomics, 25(1):216.
BACKGROUND: Shewanella xiamenensis, widely distributed in natural environments, has long been considered as opportunistic pathogen. Recently, significant changes in the resistance spectrum have been observed in S. xiamenensis, due to acquired antibiotic resistance genes. Therefore, a pan-genome analysis was conducted to illuminate the genomic changes in S. xiamenensis.
RESULTS: Phylogenetic analysis revealed three major clusters and three singletons, among which close relationship between several strains was discovered, regardless of their host and niches. The "open" genomes with diversity of accessory and strain-specific genomes took advantage towards diversity environments. The purifying selection pressure was the main force on genome evolution, especially in conservative genes. Only 53 gene families were under positive selection pressure. Phenotypic resistance analysis revealed 21 strains were classified as multi-drug resistance (MDR). Ten types of antibiotic resistance genes and two heavy metal resistance operons were discovered in S. xiamenensis. Mobile genetic elements and horizontal gene transfer increased genome diversity and were closely related to MDR strains. S. xiamenensis carried a variety of virulence genes and macromolecular secretion systems, indicating their important roles in pathogenicity and adaptability. Type IV secretion system was discovered in 15 genomes with various sequence structures, indicating it was originated from different donors through horizontal gene transfer.
CONCLUSIONS: This study provided with a detailed insight into the changes in the pan-genome of S. xiamenensis, highlighting its capability to acquire new mobile genetic elements and resistance genes for its adaptation to environment and pathogenicity to human and animals.
Additional Links: PMID-38413855
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38413855,
year = {2024},
author = {Wang, H and Xia, F and Xia, Y and Li, J and Hu, Y and Deng, Y and Zou, M},
title = {Pangenome analysis of Shewanella xiamenensis revealed important genetic traits concerning genetic diversity, pathogenicity and antibiotic resistance.},
journal = {BMC genomics},
volume = {25},
number = {1},
pages = {216},
pmid = {38413855},
issn = {1471-2164},
support = {No. 2023JJ30942//Natural Science Foundation of Hunan Province/ ; },
abstract = {BACKGROUND: Shewanella xiamenensis, widely distributed in natural environments, has long been considered as opportunistic pathogen. Recently, significant changes in the resistance spectrum have been observed in S. xiamenensis, due to acquired antibiotic resistance genes. Therefore, a pan-genome analysis was conducted to illuminate the genomic changes in S. xiamenensis.
RESULTS: Phylogenetic analysis revealed three major clusters and three singletons, among which close relationship between several strains was discovered, regardless of their host and niches. The "open" genomes with diversity of accessory and strain-specific genomes took advantage towards diversity environments. The purifying selection pressure was the main force on genome evolution, especially in conservative genes. Only 53 gene families were under positive selection pressure. Phenotypic resistance analysis revealed 21 strains were classified as multi-drug resistance (MDR). Ten types of antibiotic resistance genes and two heavy metal resistance operons were discovered in S. xiamenensis. Mobile genetic elements and horizontal gene transfer increased genome diversity and were closely related to MDR strains. S. xiamenensis carried a variety of virulence genes and macromolecular secretion systems, indicating their important roles in pathogenicity and adaptability. Type IV secretion system was discovered in 15 genomes with various sequence structures, indicating it was originated from different donors through horizontal gene transfer.
CONCLUSIONS: This study provided with a detailed insight into the changes in the pan-genome of S. xiamenensis, highlighting its capability to acquire new mobile genetic elements and resistance genes for its adaptation to environment and pathogenicity to human and animals.},
}
RevDate: 2024-02-27
Pan-chloroplast genomes for accession-specific marker development in Hibiscus syriacus.
Scientific data, 11(1):246.
Hibiscus syriacus L. is a renowned ornamental plant. We constructed 95 chloroplast genomes of H. syriacus L. cultivars using a short-read sequencing platform (Illumina) and a long-read sequencing platform (Oxford Nanopore Technology). The following genome assembly, we delineate quadripartite structures encompassing large single-copy, small single-copy, and inverted repeat (IRa and IRb) regions, from 160,231 bp to 161,041 bp. Our comprehensive analyses confirmed the presence of 79 protein-coding genes, 30 tRNA genes, and 4 rRNA genes in the pan-chloroplast genome, consistent with prior research on the H. syriacus chloroplast genome. Subsequent pangenome analysis unveiled widespread genome sequence conservation alongside unique cultivar-specific variant patterns consisting of 193 single-nucleotide polymorphisms and 61 insertions or deletions. The region containing intra-species variant patterns, as identified in this study, has the potential to develop accession-specific molecular markers, enhancing precision in cultivar classification. These findings are anticipated to drive advancements in breeding strategies, augment biodiversity, and unlock the agricultural potential inherent in H. syriacus.
Additional Links: PMID-38413611
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38413611,
year = {2024},
author = {Go, S and Koo, H and Jung, M and Hong, S and Yi, G and Kim, YM},
title = {Pan-chloroplast genomes for accession-specific marker development in Hibiscus syriacus.},
journal = {Scientific data},
volume = {11},
number = {1},
pages = {246},
pmid = {38413611},
issn = {2052-4463},
abstract = {Hibiscus syriacus L. is a renowned ornamental plant. We constructed 95 chloroplast genomes of H. syriacus L. cultivars using a short-read sequencing platform (Illumina) and a long-read sequencing platform (Oxford Nanopore Technology). The following genome assembly, we delineate quadripartite structures encompassing large single-copy, small single-copy, and inverted repeat (IRa and IRb) regions, from 160,231 bp to 161,041 bp. Our comprehensive analyses confirmed the presence of 79 protein-coding genes, 30 tRNA genes, and 4 rRNA genes in the pan-chloroplast genome, consistent with prior research on the H. syriacus chloroplast genome. Subsequent pangenome analysis unveiled widespread genome sequence conservation alongside unique cultivar-specific variant patterns consisting of 193 single-nucleotide polymorphisms and 61 insertions or deletions. The region containing intra-species variant patterns, as identified in this study, has the potential to develop accession-specific molecular markers, enhancing precision in cultivar classification. These findings are anticipated to drive advancements in breeding strategies, augment biodiversity, and unlock the agricultural potential inherent in H. syriacus.},
}
RevDate: 2024-02-27
Genomic revisitation and reclassification of the genus Providencia.
mSphere [Epub ahead of print].
Members of Providencia, although typically opportunistic, can cause severe infections in immunocompromised hosts. Recent advances in genome sequencing provide an opportunity for more precise study of this genus. In this study, we first identified and characterized a novel species named Providencia zhijiangensis sp. nov. It has ≤88.23% average nucleotide identity (ANI) and ≤31.8% in silico DNA-DNA hybridization (dDDH) values with all known Providencia species, which fall significantly below the species-defining thresholds. Interestingly, we found that Providencia stuartii and Providencia thailandensis actually fall under the same species, evidenced by an ANI of 98.59% and a dDDH value of 90.4%. By fusing ANI with phylogeny, we have reclassified 545 genomes within this genus into 20 species, including seven unnamed taxa (provisionally titled Taxon 1-7), which can be further subdivided into 23 lineages. Pangenomic analysis identified 1,550 genus-core genes in Providencia, with coenzymes being the predominant category at 10.56%, suggesting significant intermediate metabolism activity. Resistance analysis revealed that most lineages of the genus (82.61%, 19/23) carry a high number of antibiotic-resistance genes (ARGs) and display diverse resistance profiles. Notably, the majority of ARGs are located on plasmids, underscoring the significant role of plasmids in the resistance evolution within this genus. Three species or lineages (P. stuartii, Taxon 3, and Providencia hangzhouensis L12) that possess the highest number of carbapenem-resistance genes suggest their potential influence on clinical treatment. These findings underscore the need for continued surveillance and study of this genus, particularly due to their role in harboring antibiotic-resistance genes.IMPORTANCEThe Providencia genus, known to harbor opportunistic pathogens, has been a subject of interest due to its potential to cause severe infections, particularly in vulnerable individuals. Our research offers groundbreaking insights into this genus, unveiling a novel species, Providencia zhijiangensis sp. nov., and highlighting the need for a re-evaluation of existing classifications. Our comprehensive genomic assessment offers a detailed classification of 545 genomes into distinct species and lineages, revealing the rich biodiversity and intricate species diversity within the genus. The substantial presence of antibiotic-resistance genes in the Providencia genus underscores potential challenges for public health and clinical treatments. Our study highlights the pressing need for increased surveillance and research, enriching our understanding of antibiotic resistance in this realm.
Additional Links: PMID-38412041
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38412041,
year = {2024},
author = {Dong, X and Jia, H and Yu, Y and Xiang, Y and Zhang, Y},
title = {Genomic revisitation and reclassification of the genus Providencia.},
journal = {mSphere},
volume = {},
number = {},
pages = {e0073123},
doi = {10.1128/msphere.00731-23},
pmid = {38412041},
issn = {2379-5042},
abstract = {Members of Providencia, although typically opportunistic, can cause severe infections in immunocompromised hosts. Recent advances in genome sequencing provide an opportunity for more precise study of this genus. In this study, we first identified and characterized a novel species named Providencia zhijiangensis sp. nov. It has ≤88.23% average nucleotide identity (ANI) and ≤31.8% in silico DNA-DNA hybridization (dDDH) values with all known Providencia species, which fall significantly below the species-defining thresholds. Interestingly, we found that Providencia stuartii and Providencia thailandensis actually fall under the same species, evidenced by an ANI of 98.59% and a dDDH value of 90.4%. By fusing ANI with phylogeny, we have reclassified 545 genomes within this genus into 20 species, including seven unnamed taxa (provisionally titled Taxon 1-7), which can be further subdivided into 23 lineages. Pangenomic analysis identified 1,550 genus-core genes in Providencia, with coenzymes being the predominant category at 10.56%, suggesting significant intermediate metabolism activity. Resistance analysis revealed that most lineages of the genus (82.61%, 19/23) carry a high number of antibiotic-resistance genes (ARGs) and display diverse resistance profiles. Notably, the majority of ARGs are located on plasmids, underscoring the significant role of plasmids in the resistance evolution within this genus. Three species or lineages (P. stuartii, Taxon 3, and Providencia hangzhouensis L12) that possess the highest number of carbapenem-resistance genes suggest their potential influence on clinical treatment. These findings underscore the need for continued surveillance and study of this genus, particularly due to their role in harboring antibiotic-resistance genes.IMPORTANCEThe Providencia genus, known to harbor opportunistic pathogens, has been a subject of interest due to its potential to cause severe infections, particularly in vulnerable individuals. Our research offers groundbreaking insights into this genus, unveiling a novel species, Providencia zhijiangensis sp. nov., and highlighting the need for a re-evaluation of existing classifications. Our comprehensive genomic assessment offers a detailed classification of 545 genomes into distinct species and lineages, revealing the rich biodiversity and intricate species diversity within the genus. The substantial presence of antibiotic-resistance genes in the Providencia genus underscores potential challenges for public health and clinical treatments. Our study highlights the pressing need for increased surveillance and research, enriching our understanding of antibiotic resistance in this realm.},
}
RevDate: 2024-02-27
Lineage-specific evolution of Aquibium, a close relative of Mesorhizobium, during habitat adaptation.
Applied and environmental microbiology [Epub ahead of print].
The novel genus Aquibium that lacks nitrogenase was recently reclassified from the Mesorhizobium genus. The genomes of Aquibium species isolated from water were smaller and had higher GC contents than those of Mesorhizobium species. Six Mesorhizobium species lacking nitrogenase were found to exhibit low similarity in the average nucleotide identity values to the other 24 Mesorhizobium species. Therefore, they were classified as the non-N2-fixing Mesorhizobium lineage (N-ML), an evolutionary intermediate species. The results of our phylogenomic analyses and the loss of Rhizobiales-specific fur/mur indicated that Mesorhizobium species may have evolved from Aquibium species through an ecological transition. Halotolerant and alkali-resistant Aquibium and Mesorhizobium microcysteis belonging to N-ML possessed many tripartite ATP-independent periplasmic transporter and sodium/proton antiporter subunits composed of seven genes (mrpABCDEFG). These genes were not present in the N2-fixing Mesorhizobium lineage (ML), suggesting that genes acquired for adaptation to highly saline and alkaline environments were lost during the evolution of ML as the habitat changed to soil. Land-to-water habitat changes in Aquibium species, close relatives of Mesorhizobium species, could have influenced their genomic evolution by the gain and loss of genes. Our study indicated that lineage-specific evolution could have played a significant role in shaping their genome architecture and conferring their ability to thrive in different habitats.IMPORTANCEPhylogenetic analyses revealed that the Aquibium lineage (AL) and non-N2-fixing Mesorhizobium lineage (N-ML) were monophyletically grouped into distinct clusters separate from the N2-fixing Mesorhizobium lineage (ML). The N-ML, an evolutionary intermediate species having characteristics of both ancestral and descendant species, could provide a genomic snapshot of the genetic changes that occur during adaptation. Genomic analyses of AL, N-ML, and ML revealed that changes in the levels of genes related to transporters, chemotaxis, and nitrogen fixation likely reflect adaptations to different environmental conditions. Our study sheds light on the complex and dynamic nature of the evolution of rhizobia in response to changes in their environment and highlights the crucial role of genomic analysis in understanding these processes.
Additional Links: PMID-38412007
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38412007,
year = {2024},
author = {Kim, M and Kim, W and Park, Y and Jung, J and Park, W},
title = {Lineage-specific evolution of Aquibium, a close relative of Mesorhizobium, during habitat adaptation.},
journal = {Applied and environmental microbiology},
volume = {},
number = {},
pages = {e0209123},
doi = {10.1128/aem.02091-23},
pmid = {38412007},
issn = {1098-5336},
abstract = {The novel genus Aquibium that lacks nitrogenase was recently reclassified from the Mesorhizobium genus. The genomes of Aquibium species isolated from water were smaller and had higher GC contents than those of Mesorhizobium species. Six Mesorhizobium species lacking nitrogenase were found to exhibit low similarity in the average nucleotide identity values to the other 24 Mesorhizobium species. Therefore, they were classified as the non-N2-fixing Mesorhizobium lineage (N-ML), an evolutionary intermediate species. The results of our phylogenomic analyses and the loss of Rhizobiales-specific fur/mur indicated that Mesorhizobium species may have evolved from Aquibium species through an ecological transition. Halotolerant and alkali-resistant Aquibium and Mesorhizobium microcysteis belonging to N-ML possessed many tripartite ATP-independent periplasmic transporter and sodium/proton antiporter subunits composed of seven genes (mrpABCDEFG). These genes were not present in the N2-fixing Mesorhizobium lineage (ML), suggesting that genes acquired for adaptation to highly saline and alkaline environments were lost during the evolution of ML as the habitat changed to soil. Land-to-water habitat changes in Aquibium species, close relatives of Mesorhizobium species, could have influenced their genomic evolution by the gain and loss of genes. Our study indicated that lineage-specific evolution could have played a significant role in shaping their genome architecture and conferring their ability to thrive in different habitats.IMPORTANCEPhylogenetic analyses revealed that the Aquibium lineage (AL) and non-N2-fixing Mesorhizobium lineage (N-ML) were monophyletically grouped into distinct clusters separate from the N2-fixing Mesorhizobium lineage (ML). The N-ML, an evolutionary intermediate species having characteristics of both ancestral and descendant species, could provide a genomic snapshot of the genetic changes that occur during adaptation. Genomic analyses of AL, N-ML, and ML revealed that changes in the levels of genes related to transporters, chemotaxis, and nitrogen fixation likely reflect adaptations to different environmental conditions. Our study sheds light on the complex and dynamic nature of the evolution of rhizobia in response to changes in their environment and highlights the crucial role of genomic analysis in understanding these processes.},
}
RevDate: 2024-02-27
Strain-Specific Anti-Inflammatory Effects of Faecalibacterium prausnitzii Strain KBL1027 in Koreans.
Probiotics and antimicrobial proteins [Epub ahead of print].
Faecalibacterium prausnitzii is one of the most dominant commensal bacteria in the human gut, and certain anti-inflammatory functions have been attributed to a single microbial anti-inflammatory molecule (MAM). Simultaneously, substantial diversity among F. prausnitzii strains is acknowledged, emphasizing the need for strain-level functional studies aimed at developing innovative probiotics. Here, two distinct F. prausnitzii strains, KBL1026 and KBL1027, were isolated from Korean donors, exhibiting notable differences in the relative abundance of F. prausnitzii. Both strains were identified as the core Faecalibacterium amplicon sequence variant (ASV) within the healthy Korean cohort, and their MAM sequences showed a high similarity of 98.6%. However, when a single strain was introduced to mice with dextran sulfate sodium (DSS)-induced colitis, KBL1027 showed the most significant ameliorative effects, including alleviation of colonic inflammation and restoration of gut microbial dysbiosis. Moreover, the supernatant from KBL1027 elevated the secretion of IL-10 cytokine more than that of KBL1026 in mouse bone marrow-derived macrophage (BMDM) cells, suggesting that the strain-specific, anti-inflammatory efficacy of KBL1027 might involve effector compounds other than MAM. Through analysis of the Faecalibacterium pan-genome and comparative genomics, strain-specific functions related to extracellular polysaccharide biosynthesis were identified in KBL1027, which could contribute to the observed morphological disparities. Collectively, our findings highlight the strain-specific, anti-inflammatory functions of F. prausnitzii, even within the same core ASV, emphasizing the influence of their human origin.
Additional Links: PMID-38411865
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38411865,
year = {2024},
author = {Seo, B and Jeon, K and Kim, WK and Jang, YJ and Cha, KH and Ko, G},
title = {Strain-Specific Anti-Inflammatory Effects of Faecalibacterium prausnitzii Strain KBL1027 in Koreans.},
journal = {Probiotics and antimicrobial proteins},
volume = {},
number = {},
pages = {},
pmid = {38411865},
issn = {1867-1314},
support = {E0170600-07//Korea Food Research Institute/ ; RS-2023-00223831//National Research Foundation of Korea/ ; },
abstract = {Faecalibacterium prausnitzii is one of the most dominant commensal bacteria in the human gut, and certain anti-inflammatory functions have been attributed to a single microbial anti-inflammatory molecule (MAM). Simultaneously, substantial diversity among F. prausnitzii strains is acknowledged, emphasizing the need for strain-level functional studies aimed at developing innovative probiotics. Here, two distinct F. prausnitzii strains, KBL1026 and KBL1027, were isolated from Korean donors, exhibiting notable differences in the relative abundance of F. prausnitzii. Both strains were identified as the core Faecalibacterium amplicon sequence variant (ASV) within the healthy Korean cohort, and their MAM sequences showed a high similarity of 98.6%. However, when a single strain was introduced to mice with dextran sulfate sodium (DSS)-induced colitis, KBL1027 showed the most significant ameliorative effects, including alleviation of colonic inflammation and restoration of gut microbial dysbiosis. Moreover, the supernatant from KBL1027 elevated the secretion of IL-10 cytokine more than that of KBL1026 in mouse bone marrow-derived macrophage (BMDM) cells, suggesting that the strain-specific, anti-inflammatory efficacy of KBL1027 might involve effector compounds other than MAM. Through analysis of the Faecalibacterium pan-genome and comparative genomics, strain-specific functions related to extracellular polysaccharide biosynthesis were identified in KBL1027, which could contribute to the observed morphological disparities. Collectively, our findings highlight the strain-specific, anti-inflammatory functions of F. prausnitzii, even within the same core ASV, emphasizing the influence of their human origin.},
}
RevDate: 2024-02-27
Defense systems and horizontal gene transfer in bacteria.
bioRxiv : the preprint server for biology pii:2024.02.09.579689.
Horizontal gene transfer (HGT) is a fundamental process in the evolution of prokaryotes, making major contributions to diversification and adaptation. Typically, HGT is facilitated by mobile genetic elements (MGEs), such as conjugative plasmids and phages that generally impose fitness costs on their hosts. However, a substantial fraction of bacterial genes is involved in defense mechanisms that limit the propagation of MGEs, raising the possibility that they can actively restrict HGT. Here we examine whether defense systems curb HGT by exploring the connections between HGT rate and the presence of 73 defense systems in 12 bacterial species. We found that only 6 defense systems, 3 of which are different CRISPR-Cas subtypes, are associated with the reduced gene gain rate on the scale of species evolution. The hosts of such defense systems tend to have a smaller pangenome size and harbor fewer phage-related genes compared to genomes lacking these systems, suggesting that these defense mechanisms inhibit HGT by limiting the integration of prophages. We hypothesize that restriction of HGT by defense systems is species-specific and depends on various ecological and genetic factors, including the burden of MGEs and fitness effect of HGT in bacterial populations.
Additional Links: PMID-38410456
Full Text:
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38410456,
year = {2024},
author = {Kogay, R and Wolf, YI and Koonin, EV},
title = {Defense systems and horizontal gene transfer in bacteria.},
journal = {bioRxiv : the preprint server for biology},
volume = {},
number = {},
pages = {},
doi = {10.1101/2024.02.09.579689},
pmid = {38410456},
abstract = {Horizontal gene transfer (HGT) is a fundamental process in the evolution of prokaryotes, making major contributions to diversification and adaptation. Typically, HGT is facilitated by mobile genetic elements (MGEs), such as conjugative plasmids and phages that generally impose fitness costs on their hosts. However, a substantial fraction of bacterial genes is involved in defense mechanisms that limit the propagation of MGEs, raising the possibility that they can actively restrict HGT. Here we examine whether defense systems curb HGT by exploring the connections between HGT rate and the presence of 73 defense systems in 12 bacterial species. We found that only 6 defense systems, 3 of which are different CRISPR-Cas subtypes, are associated with the reduced gene gain rate on the scale of species evolution. The hosts of such defense systems tend to have a smaller pangenome size and harbor fewer phage-related genes compared to genomes lacking these systems, suggesting that these defense mechanisms inhibit HGT by limiting the integration of prophages. We hypothesize that restriction of HGT by defense systems is species-specific and depends on various ecological and genetic factors, including the burden of MGEs and fitness effect of HGT in bacterial populations.},
}
RevDate: 2024-02-26
Genomic insights into an extensively drug-resistant and hypervirulent Burkholderia dolosa N149 isolate of a novel sequence type (ST2237) from a Vietnamese patient hospitalized for stroke.
Journal of global antimicrobial resistance pii:S2213-7165(24)00036-5 [Epub ahead of print].
OBJECTIVES: Burkholderia dolosa is a clinically important opportunistic pathogen in inpatients. Here we characterized an extensively drug-resistant and hypervirulent B. dolosa isolate from a patient hospitalized for stroke.
METHODS: Resistance to 41 antibiotics was tested with the agar disc diffusion, minimum inhibitory concentration, or broth microdilution method. The complete genome was assembled using short-reads and long-reads and the hybrid de novo assembly method. Allelic profiles obtained by multilocus sequence typing were analyzed using the PubMLST database. Antibiotic-resistance and virulence genes were predicted in silico using public databases and the "baargin" workflow. B. dolosa N149 phylogenetic relationships with all available B. dolosa strains and Burkholderia cepacia complex strains were analyzed using the pangenome obtained with Roary.
RESULTS: B. dolosa N149 displayed extensive resistance to 31 antibiotics and intermediate resistance to 4 antibiotics. The complete genome included three circular chromosomes (6,338,630 bp in total) and one plasmid (167,591 bp). Genotypic analysis revealed various gene clusters (acr, amr, amp, emr, ade, bla and tet) associated with resistance to 35 antibiotic classes. The major intrinsic resistance mechanisms were multidrug efflux pump alterations, inactivation and reduced permeability of targeted antibiotics. Moreover, 91 virulence genes (encoding proteins involved in adherence, formation of capsule, biofilm and colony, motility, phagocytosis inhibition, secretion systems, protease secretion, transmission and quorum sensing) were identified. B. dolosa N149 was assigned to a novel sequence type (ST2237) and formed a mono-phylogenetic clade separated from other B. dolosa strains.
CONCLUSION: This study provided insights into the antimicrobial resistance and virulence mechanisms of B. dolosa.
Additional Links: PMID-38408562
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38408562,
year = {2024},
author = {Huy, NQ and Linh, NC and Son, NT and Ngoc, DB and Tam, TTT and Hang, LTT and Thuyet, BT and Song, LH and Van Quyen, D and Hayer, J and Bañuls, AL and Sy, BT},
title = {Genomic insights into an extensively drug-resistant and hypervirulent Burkholderia dolosa N149 isolate of a novel sequence type (ST2237) from a Vietnamese patient hospitalized for stroke.},
journal = {Journal of global antimicrobial resistance},
volume = {},
number = {},
pages = {},
doi = {10.1016/j.jgar.2024.02.009},
pmid = {38408562},
issn = {2213-7173},
abstract = {OBJECTIVES: Burkholderia dolosa is a clinically important opportunistic pathogen in inpatients. Here we characterized an extensively drug-resistant and hypervirulent B. dolosa isolate from a patient hospitalized for stroke.
METHODS: Resistance to 41 antibiotics was tested with the agar disc diffusion, minimum inhibitory concentration, or broth microdilution method. The complete genome was assembled using short-reads and long-reads and the hybrid de novo assembly method. Allelic profiles obtained by multilocus sequence typing were analyzed using the PubMLST database. Antibiotic-resistance and virulence genes were predicted in silico using public databases and the "baargin" workflow. B. dolosa N149 phylogenetic relationships with all available B. dolosa strains and Burkholderia cepacia complex strains were analyzed using the pangenome obtained with Roary.
RESULTS: B. dolosa N149 displayed extensive resistance to 31 antibiotics and intermediate resistance to 4 antibiotics. The complete genome included three circular chromosomes (6,338,630 bp in total) and one plasmid (167,591 bp). Genotypic analysis revealed various gene clusters (acr, amr, amp, emr, ade, bla and tet) associated with resistance to 35 antibiotic classes. The major intrinsic resistance mechanisms were multidrug efflux pump alterations, inactivation and reduced permeability of targeted antibiotics. Moreover, 91 virulence genes (encoding proteins involved in adherence, formation of capsule, biofilm and colony, motility, phagocytosis inhibition, secretion systems, protease secretion, transmission and quorum sensing) were identified. B. dolosa N149 was assigned to a novel sequence type (ST2237) and formed a mono-phylogenetic clade separated from other B. dolosa strains.
CONCLUSION: This study provided insights into the antimicrobial resistance and virulence mechanisms of B. dolosa.},
}
RevDate: 2024-02-26
Identification of a novel CG307 sub-clade in third-generation-cephalosporin-resistant Klebsiella pneumoniae causing invasive infections in the USA.
Microbial genomics, 10(2):.
Despite the notable clinical impact, recent molecular epidemiology regarding third-generation-cephalosporin-resistant (3GC-R) Klebsiella pneumoniae in the USA remains limited. We performed whole-genome sequencing of 3GC-R K. pneumoniae bacteraemia isolates collected from March 2016 to May 2022 at a tertiary care cancer centre in Houston, TX, USA, using Illumina and Oxford Nanopore Technologies platforms. A comprehensive comparative genomic analysis was performed to dissect population structure, transmission dynamics and pan-genomic signatures of our 3GC-R K. pneumoniae population. Of the 178 3GC-R K. pneumoniae bacteraemias that occurred during our study time frame, we were able to analyse 153 (86 %) bacteraemia isolates, 126 initial and 27 recurrent isolates. While isolates belonging to the widely prevalent clonal group (CG) 258 were rarely observed, the predominant CG, 307, accounted for 37 (29 %) index isolates and displayed a significant correlation (Pearson correlation test P value=0.03) with the annual frequency of 3GC-R K. pneumoniae bacteraemia. Interestingly, only 11 % (4/37) of CG307 isolates belonged to the commonly detected 'Texas-specific' clade that has been observed in previous Texas-based K. pneumoniae antimicrobial-resistance surveillance studies. We identified nearly half of our CG307 isolates (n=18) belonged to a novel, monophyletic CG307 sub-clade characterized by the chromosomally encoded bla SHV-205 and unique accessory genome content. This CG307 sub-clade was detected in various regions of the USA, with genome sequences from 24 additional strains becoming recently available in the National Center for Biotechnology Information (NCBI) SRA database. Collectively, this study underscores the emergence and dissemination of a distinct CG307 sub-clade that is a prevalent cause of 3GC-R K. pneumoniae bacteraemia among cancer patients seen in Houston, TX, and has recently been isolated throughout the USA.
Additional Links: PMID-38407244
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38407244,
year = {2024},
author = {Selvaraj Anand, S and Wu, CT and Bremer, J and Bhatti, M and Treangen, TJ and Kalia, A and Shelburne, SA and Shropshire, WC},
title = {Identification of a novel CG307 sub-clade in third-generation-cephalosporin-resistant Klebsiella pneumoniae causing invasive infections in the USA.},
journal = {Microbial genomics},
volume = {10},
number = {2},
pages = {},
doi = {10.1099/mgen.0.001201},
pmid = {38407244},
issn = {2057-5858},
abstract = {Despite the notable clinical impact, recent molecular epidemiology regarding third-generation-cephalosporin-resistant (3GC-R) Klebsiella pneumoniae in the USA remains limited. We performed whole-genome sequencing of 3GC-R K. pneumoniae bacteraemia isolates collected from March 2016 to May 2022 at a tertiary care cancer centre in Houston, TX, USA, using Illumina and Oxford Nanopore Technologies platforms. A comprehensive comparative genomic analysis was performed to dissect population structure, transmission dynamics and pan-genomic signatures of our 3GC-R K. pneumoniae population. Of the 178 3GC-R K. pneumoniae bacteraemias that occurred during our study time frame, we were able to analyse 153 (86 %) bacteraemia isolates, 126 initial and 27 recurrent isolates. While isolates belonging to the widely prevalent clonal group (CG) 258 were rarely observed, the predominant CG, 307, accounted for 37 (29 %) index isolates and displayed a significant correlation (Pearson correlation test P value=0.03) with the annual frequency of 3GC-R K. pneumoniae bacteraemia. Interestingly, only 11 % (4/37) of CG307 isolates belonged to the commonly detected 'Texas-specific' clade that has been observed in previous Texas-based K. pneumoniae antimicrobial-resistance surveillance studies. We identified nearly half of our CG307 isolates (n=18) belonged to a novel, monophyletic CG307 sub-clade characterized by the chromosomally encoded bla SHV-205 and unique accessory genome content. This CG307 sub-clade was detected in various regions of the USA, with genome sequences from 24 additional strains becoming recently available in the National Center for Biotechnology Information (NCBI) SRA database. Collectively, this study underscores the emergence and dissemination of a distinct CG307 sub-clade that is a prevalent cause of 3GC-R K. pneumoniae bacteraemia among cancer patients seen in Houston, TX, and has recently been isolated throughout the USA.},
}
RevDate: 2024-02-25
Segmental duplications drive the evolution of accessory regions in a major crop pathogen.
The New phytologist [Epub ahead of print].
Many pathogens evolved compartmentalized genomes with conserved core and variable accessory regions (ARs) that carry effector genes mediating virulence. The fungal plant pathogen Fusarium oxysporum has such ARs, often spanning entire chromosomes. The presence of specific ARs influences the host range, and horizontal transfer of ARs can modify the pathogenicity of the receiving strain. However, how these ARs evolve in strains that infect the same host remains largely unknown. We defined the pan-genome of 69 diverse F. oxysporum strains that cause Fusarium wilt of banana, a significant constraint to global banana production, and analyzed the diversity and evolution of the ARs. Accessory regions in F. oxysporum strains infecting the same banana cultivar are highly diverse, and we could not identify any shared genomic regions and in planta-induced effectors. We demonstrate that segmental duplications drive the evolution of ARs. Furthermore, we show that recent segmental duplications specifically in accessory chromosomes cause the expansion of ARs in F. oxysporum. Taken together, we conclude that extensive recent duplications drive the evolution of ARs in F. oxysporum, which contribute to the evolution of virulence.
Additional Links: PMID-38402521
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38402521,
year = {2024},
author = {van Westerhoven, AC and Aguilera-Galvez, C and Nakasato-Tagami, G and Shi-Kunne, X and Martinez de la Parte, E and Chavarro-Carrero, E and Meijer, HJG and Feurtey, A and Maryani, N and Ordóñez, N and Schneiders, H and Nijbroek, K and Wittenberg, AHJ and Hofstede, R and García-Bastidas, F and Sørensen, A and Swennen, R and Drenth, A and Stukenbrock, EH and Kema, GHJ and Seidl, MF},
title = {Segmental duplications drive the evolution of accessory regions in a major crop pathogen.},
journal = {The New phytologist},
volume = {},
number = {},
pages = {},
doi = {10.1111/nph.19604},
pmid = {38402521},
issn = {1469-8137},
support = {AG - 442//Bill and Melinda Gates Foundation/ ; 20 04 04 02//Stichting Dioraphte/ ; },
abstract = {Many pathogens evolved compartmentalized genomes with conserved core and variable accessory regions (ARs) that carry effector genes mediating virulence. The fungal plant pathogen Fusarium oxysporum has such ARs, often spanning entire chromosomes. The presence of specific ARs influences the host range, and horizontal transfer of ARs can modify the pathogenicity of the receiving strain. However, how these ARs evolve in strains that infect the same host remains largely unknown. We defined the pan-genome of 69 diverse F. oxysporum strains that cause Fusarium wilt of banana, a significant constraint to global banana production, and analyzed the diversity and evolution of the ARs. Accessory regions in F. oxysporum strains infecting the same banana cultivar are highly diverse, and we could not identify any shared genomic regions and in planta-induced effectors. We demonstrate that segmental duplications drive the evolution of ARs. Furthermore, we show that recent segmental duplications specifically in accessory chromosomes cause the expansion of ARs in F. oxysporum. Taken together, we conclude that extensive recent duplications drive the evolution of ARs in F. oxysporum, which contribute to the evolution of virulence.},
}
RevDate: 2024-02-24
Decoding the Genomic Profile of the Halomicroarcula Genus: Comparative Analysis and Characterization of Two Novel Species.
Microorganisms, 12(2):.
The genus Halomicroarcula, classified within the family Haloarculaceae, presently comprises eight haloarchaeal species isolated from diverse saline habitats, such as solar salterns, hypersaline soils, marine salt, and marine algae. Here, a detailed taxogenomic study and comparative genomic analysis of the genus Halomicroarcula was carried out. In addition, two strains, designated S1CR25-12[T] and S3CR25-11[T], that were isolated from hypersaline soils located in the Odiel Saltmarshes in Huelva (Spain) were included in this study. The 16S rRNA and rpoB' gene sequence analyses affiliated the two strains to the genus Halomicroarcula. Typically, the species of the genus Halomicroarcula possess multiple heterogeneous copies of the 16S rRNA gene, which can lead to misclassification of the taxa and overestimation of the prokaryotic diversity. In contrast, the application of overall genome relatedness indexes (OGRIs) augments the capacity for the precise taxonomic classification and categorization of prokaryotic organisms. The relatedness indexes of the two new isolates, particularly digital DNA-DNA hybridization (dDDH), orthologous average nucleotide identity (OrthoANI), and average amino acid identity (AAI), confirmed that strains S1CR25-12[T] (= CECT 30620[T] = CCM 9252[T]) and S3CR25-11[T] (= CECT 30621[T] = CCM 9254[T]) constitute two novel species of the genus Halomicroarcula. The names Halomicroarcula saliterrae sp. nov. and Halomicroarcula onubensis sp. nov. are proposed for S1CR25-12[T] and S3CR25-11[T], respectively. Metagenomic fragment recruitment analysis, conducted using seven shotgun metagenomic datasets, revealed that the species belonging to the genus Halomicroarcula were predominantly recruited from hypersaline soils found in the Odiel Saltmarshes and the ponds of salterns with high salt concentrations. This reinforces the understanding of the extreme halophilic characteristics associated with the genus Halomicroarcula. Finally, comparing pan-genomes across the twenty Halomicroarcula and Haloarcula species allowed for the identification of commonalities and differences between the species of these two related genera.
Additional Links: PMID-38399738
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38399738,
year = {2024},
author = {Straková, D and Sánchez-Porro, C and de la Haba, RR and Ventosa, A},
title = {Decoding the Genomic Profile of the Halomicroarcula Genus: Comparative Analysis and Characterization of Two Novel Species.},
journal = {Microorganisms},
volume = {12},
number = {2},
pages = {},
pmid = {38399738},
issn = {2076-2607},
support = {PID2020-118136GB-I00//MCIN/AEI/10.13039/501100011033/ ; P20_01066 and BIO-213//Junta de Andalucía/ ; },
abstract = {The genus Halomicroarcula, classified within the family Haloarculaceae, presently comprises eight haloarchaeal species isolated from diverse saline habitats, such as solar salterns, hypersaline soils, marine salt, and marine algae. Here, a detailed taxogenomic study and comparative genomic analysis of the genus Halomicroarcula was carried out. In addition, two strains, designated S1CR25-12[T] and S3CR25-11[T], that were isolated from hypersaline soils located in the Odiel Saltmarshes in Huelva (Spain) were included in this study. The 16S rRNA and rpoB' gene sequence analyses affiliated the two strains to the genus Halomicroarcula. Typically, the species of the genus Halomicroarcula possess multiple heterogeneous copies of the 16S rRNA gene, which can lead to misclassification of the taxa and overestimation of the prokaryotic diversity. In contrast, the application of overall genome relatedness indexes (OGRIs) augments the capacity for the precise taxonomic classification and categorization of prokaryotic organisms. The relatedness indexes of the two new isolates, particularly digital DNA-DNA hybridization (dDDH), orthologous average nucleotide identity (OrthoANI), and average amino acid identity (AAI), confirmed that strains S1CR25-12[T] (= CECT 30620[T] = CCM 9252[T]) and S3CR25-11[T] (= CECT 30621[T] = CCM 9254[T]) constitute two novel species of the genus Halomicroarcula. The names Halomicroarcula saliterrae sp. nov. and Halomicroarcula onubensis sp. nov. are proposed for S1CR25-12[T] and S3CR25-11[T], respectively. Metagenomic fragment recruitment analysis, conducted using seven shotgun metagenomic datasets, revealed that the species belonging to the genus Halomicroarcula were predominantly recruited from hypersaline soils found in the Odiel Saltmarshes and the ponds of salterns with high salt concentrations. This reinforces the understanding of the extreme halophilic characteristics associated with the genus Halomicroarcula. Finally, comparing pan-genomes across the twenty Halomicroarcula and Haloarcula species allowed for the identification of commonalities and differences between the species of these two related genera.},
}
RevDate: 2024-02-24
Molecular Genomic Analyses of Enterococcus cecorum from Sepsis Outbreaks in Broilers.
Microorganisms, 12(2): pii:microorganisms12020250.
Extensive genomic analyses of Enterococcus cecorum isolates from sepsis outbreaks in broilers suggest a polyphyletic origin, likely arising from core genome mutations rather than gene acquisition. This species is a normal intestinal flora of avian species with particular isolates associated with osteomyelitis. More recently, this species has been associated with sepsis outbreaks affecting broilers during the first 3 weeks post-hatch. Understanding the genetic and management basis of this new phenotype is critical for developing strategies to mitigate this emerging problem. Phylogenomic analyses of 227 genomes suggest that sepsis isolates are polyphyletic and closely related to both commensal and osteomyelitis isolate genomes. Pangenome analyses detect no gene acquisitions that distinguish all the sepsis isolates. Core genome single nucleotide polymorphism analyses have identified a number of mutations, affecting the protein-coding sequences, that are enriched in sepsis isolates. The analysis of the protein substitutions supports the mutational origins of sepsis isolates.
Additional Links: PMID-38399654
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38399654,
year = {2024},
author = {Rhoads, DD and Pummill, J and Alrubaye, AAK},
title = {Molecular Genomic Analyses of Enterococcus cecorum from Sepsis Outbreaks in Broilers.},
journal = {Microorganisms},
volume = {12},
number = {2},
pages = {},
doi = {10.3390/microorganisms12020250},
pmid = {38399654},
issn = {2076-2607},
support = {none//Arkansas Biosciences Institute/ ; },
abstract = {Extensive genomic analyses of Enterococcus cecorum isolates from sepsis outbreaks in broilers suggest a polyphyletic origin, likely arising from core genome mutations rather than gene acquisition. This species is a normal intestinal flora of avian species with particular isolates associated with osteomyelitis. More recently, this species has been associated with sepsis outbreaks affecting broilers during the first 3 weeks post-hatch. Understanding the genetic and management basis of this new phenotype is critical for developing strategies to mitigate this emerging problem. Phylogenomic analyses of 227 genomes suggest that sepsis isolates are polyphyletic and closely related to both commensal and osteomyelitis isolate genomes. Pangenome analyses detect no gene acquisitions that distinguish all the sepsis isolates. Core genome single nucleotide polymorphism analyses have identified a number of mutations, affecting the protein-coding sequences, that are enriched in sepsis isolates. The analysis of the protein substitutions supports the mutational origins of sepsis isolates.},
}
RevDate: 2024-02-24
In-Depth Genome Characterization and Pan-Genome Analysis of Strain KMM 296, a Producer of Highly Active Alkaline Phosphatase; Proposal for the Reclassification of Cobetia litoralis and Cobetia pacifica as the Later Heterotypic Synonyms of Cobetia amphilecti and Cobetia marina, and Emended Description of the Species Cobetia amphilecti and Cobetia marina.
Biomolecules, 14(2): pii:biom14020196.
A strictly aerobic, Gram-stain-negative, rod-shaped, and motile bacterium, designated strain KMM 296, isolated from the coelomic fluid of the mussel Crenomytilus grayanus, was investigated in detail due to its ability to produce a highly active alkaline phosphatase CmAP of the structural family PhoA. A previous taxonomic study allocated the strain to the species Cobetia marina, a member of the family Halomonadaceae of the class Gammaproteobacteria. However, 16S rRNA gene sequencing showed KMM 296's relatedness to Cobetia amphilecti NRIC 0815[T]. The isolate grew with 0.5-19% NaCl at 4-42 °C and hydrolyzed Tweens 20 and 40 and L-tyrosine. The DNA G+C content was 62.5 mol%. The prevalent fatty acids were C18:1 ω7c, C12:0 3-OH, C18:1 ω7c, C12:0, and C17:0 cyclo. The polar lipid profile was characterized by the presence of phosphatidylethanolamine, phosphatidylglycerol, phosphatidic acid, and also an unidentified aminolipid, phospholipid, and a few unidentified lipids. The major respiratory quinone was Q-8. According to phylogenomic and chemotaxonomic evidence, and the nearest neighbors, the strain KMM 296 represents a member of the species C. amphilecti. The genome-based analysis of C. amphilecti NRIC 0815[T] and C. litoralis NRIC 0814[T] showed their belonging to a single species. In addition, the high similarity between the C. pacifica NRIC 0813[T] and C. marina LMG 2217[T] genomes suggests their affiliation to one species. Based on the rules of priority, C. litoralis should be reclassified as a later heterotypic synonym of C. amphilecti, and C. pacifica is a later heterotypic synonym of C. marina. The emended descriptions of the species C. amphilecti and C. marina are also proposed.
Additional Links: PMID-38397433
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38397433,
year = {2024},
author = {Nedashkovskaya, O and Balabanova, L and Otstavnykh, N and Zhukova, N and Detkova, E and Seitkalieva, A and Bystritskaya, E and Noskova, Y and Tekutyeva, L and Isaeva, M},
title = {In-Depth Genome Characterization and Pan-Genome Analysis of Strain KMM 296, a Producer of Highly Active Alkaline Phosphatase; Proposal for the Reclassification of Cobetia litoralis and Cobetia pacifica as the Later Heterotypic Synonyms of Cobetia amphilecti and Cobetia marina, and Emended Description of the Species Cobetia amphilecti and Cobetia marina.},
journal = {Biomolecules},
volume = {14},
number = {2},
pages = {},
doi = {10.3390/biom14020196},
pmid = {38397433},
issn = {2218-273X},
support = {15.BRK.21.0004 (contract no. 075-15-2021-1052)//the Ministry of Science and Higher Education, Russian Federation/ ; },
abstract = {A strictly aerobic, Gram-stain-negative, rod-shaped, and motile bacterium, designated strain KMM 296, isolated from the coelomic fluid of the mussel Crenomytilus grayanus, was investigated in detail due to its ability to produce a highly active alkaline phosphatase CmAP of the structural family PhoA. A previous taxonomic study allocated the strain to the species Cobetia marina, a member of the family Halomonadaceae of the class Gammaproteobacteria. However, 16S rRNA gene sequencing showed KMM 296's relatedness to Cobetia amphilecti NRIC 0815[T]. The isolate grew with 0.5-19% NaCl at 4-42 °C and hydrolyzed Tweens 20 and 40 and L-tyrosine. The DNA G+C content was 62.5 mol%. The prevalent fatty acids were C18:1 ω7c, C12:0 3-OH, C18:1 ω7c, C12:0, and C17:0 cyclo. The polar lipid profile was characterized by the presence of phosphatidylethanolamine, phosphatidylglycerol, phosphatidic acid, and also an unidentified aminolipid, phospholipid, and a few unidentified lipids. The major respiratory quinone was Q-8. According to phylogenomic and chemotaxonomic evidence, and the nearest neighbors, the strain KMM 296 represents a member of the species C. amphilecti. The genome-based analysis of C. amphilecti NRIC 0815[T] and C. litoralis NRIC 0814[T] showed their belonging to a single species. In addition, the high similarity between the C. pacifica NRIC 0813[T] and C. marina LMG 2217[T] genomes suggests their affiliation to one species. Based on the rules of priority, C. litoralis should be reclassified as a later heterotypic synonym of C. amphilecti, and C. pacifica is a later heterotypic synonym of C. marina. The emended descriptions of the species C. amphilecti and C. marina are also proposed.},
}
RevDate: 2024-02-24
New Obolenskvirus Phages Brutus and Scipio: Biology, Evolution, and Phage-Host Interaction.
International journal of molecular sciences, 25(4): pii:ijms25042074.
Two novel virulent phages of the genus Obolenskvirus infecting Acinetobacter baumannii, a significant nosocomial pathogen, have been isolated and studied. Phages Brutus and Scipio were able to infect A. baumannii strains belonging to the K116 and K82 capsular types, respectively. The biological properties and genomic organization of the phages were characterized. Comparative genomic, phylogenetic, and pangenomic analyses were performed to investigate the relationship of Brutus and Scipio to other bacterial viruses and to trace the possible origin and evolutionary history of these phages and other representatives of the genus Obolenskvirus. The investigation of enzymatic activity of the tailspike depolymerase encoded in the genome of phage Scipio, the first reported virus infecting A. baumannii of the K82 capsular type, was performed. The study of new representatives of the genus Obolenskvirus and mechanisms of action of depolymerases encoded in their genomes expands knowledge about the diversity of viruses within this taxonomic group and strategies of Obolenskvirus-host bacteria interaction.
Additional Links: PMID-38396752
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38396752,
year = {2024},
author = {Evseev, PV and Shneider, MM and Kolupaeva, LV and Kasimova, AA and Timoshina, OY and Perepelov, AV and Shpirt, AM and Shelenkov, AA and Mikhailova, YV and Suzina, NE and Knirel, YA and Miroshnikov, KA and Popova, AV},
title = {New Obolenskvirus Phages Brutus and Scipio: Biology, Evolution, and Phage-Host Interaction.},
journal = {International journal of molecular sciences},
volume = {25},
number = {4},
pages = {},
doi = {10.3390/ijms25042074},
pmid = {38396752},
issn = {1422-0067},
support = {20-75-10113//Russian Science Foundation/ ; },
abstract = {Two novel virulent phages of the genus Obolenskvirus infecting Acinetobacter baumannii, a significant nosocomial pathogen, have been isolated and studied. Phages Brutus and Scipio were able to infect A. baumannii strains belonging to the K116 and K82 capsular types, respectively. The biological properties and genomic organization of the phages were characterized. Comparative genomic, phylogenetic, and pangenomic analyses were performed to investigate the relationship of Brutus and Scipio to other bacterial viruses and to trace the possible origin and evolutionary history of these phages and other representatives of the genus Obolenskvirus. The investigation of enzymatic activity of the tailspike depolymerase encoded in the genome of phage Scipio, the first reported virus infecting A. baumannii of the K82 capsular type, was performed. The study of new representatives of the genus Obolenskvirus and mechanisms of action of depolymerases encoded in their genomes expands knowledge about the diversity of viruses within this taxonomic group and strategies of Obolenskvirus-host bacteria interaction.},
}
RevDate: 2024-02-24
Robustness of cancer microbiome signals over a broad range of methodological variation.
Oncogene [Epub ahead of print].
In 2020, we identified cancer-specific microbial signals in The Cancer Genome Atlas (TCGA) [1]. Multiple peer-reviewed papers independently verified or extended our findings [2-12]. Given this impact, we carefully considered concerns by Gihawi et al. [13] that batch correction and database contamination with host sequences artificially created the appearance of cancer type-specific microbiomes. (1) We tested batch correction by comparing raw and Voom-SNM-corrected data per-batch, finding predictive equivalence and significantly similar features. We found consistent results with a modern microbiome-specific method (ConQuR [14]), and when restricting to taxa found in an independent, highly-decontaminated cohort. (2) Using Conterminator [15], we found low levels of human contamination in our original databases (~1% of genomes). We demonstrated that the increased detection of human reads in Gihawi et al. [13] was due to using a newer human genome reference. (3) We developed Exhaustive, a method twice as sensitive as Conterminator, to clean RefSeq. We comprehensively host-deplete TCGA with many human (pan)genome references. We repeated all analyses with this and the Gihawi et al. [13] pipeline, and found cancer type-specific microbiomes. These extensive re-analyses and updated methods validate our original conclusion that cancer type-specific microbial signatures exist in TCGA, and show they are robust to methodology.
Additional Links: PMID-38396294
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38396294,
year = {2024},
author = {Sepich-Poore, GD and McDonald, D and Kopylova, E and Guccione, C and Zhu, Q and Austin, G and Carpenter, C and Fraraccio, S and Wandro, S and Kosciolek, T and Janssen, S and Metcalf, JL and Song, SJ and Kanbar, J and Miller-Montgomery, S and Heaton, R and Mckay, R and Patel, SP and Swafford, AD and Korem, T and Knight, R},
title = {Robustness of cancer microbiome signals over a broad range of methodological variation.},
journal = {Oncogene},
volume = {},
number = {},
pages = {},
pmid = {38396294},
issn = {1476-5594},
support = {U24 CA248454/CA/NCI NIH HHS/United States ; },
abstract = {In 2020, we identified cancer-specific microbial signals in The Cancer Genome Atlas (TCGA) [1]. Multiple peer-reviewed papers independently verified or extended our findings [2-12]. Given this impact, we carefully considered concerns by Gihawi et al. [13] that batch correction and database contamination with host sequences artificially created the appearance of cancer type-specific microbiomes. (1) We tested batch correction by comparing raw and Voom-SNM-corrected data per-batch, finding predictive equivalence and significantly similar features. We found consistent results with a modern microbiome-specific method (ConQuR [14]), and when restricting to taxa found in an independent, highly-decontaminated cohort. (2) Using Conterminator [15], we found low levels of human contamination in our original databases (~1% of genomes). We demonstrated that the increased detection of human reads in Gihawi et al. [13] was due to using a newer human genome reference. (3) We developed Exhaustive, a method twice as sensitive as Conterminator, to clean RefSeq. We comprehensively host-deplete TCGA with many human (pan)genome references. We repeated all analyses with this and the Gihawi et al. [13] pipeline, and found cancer type-specific microbiomes. These extensive re-analyses and updated methods validate our original conclusion that cancer type-specific microbial signatures exist in TCGA, and show they are robust to methodology.},
}
RevDate: 2024-02-24
Whole genome sequencing and characterization of Pantoea agglomerans DBM 3797, endophyte, isolated from fresh hop (Humulus lupulus L.).
Frontiers in microbiology, 15:1305338.
BACKGROUND: This paper brings new information about the genome and phenotypic characteristics of Pantoea agglomerans strain DBM 3797, isolated from fresh Czech hop (Humulus lupulus) in the Saaz hop-growing region. Although P. agglomerans strains are frequently isolated from different materials, there are not usually thoroughly characterized even if they have versatile metabolism and those isolated from plants may have a considerable potential for application in agriculture as a support culture for plant growth.
METHODS: P. agglomerans DBM 3797 was cultured under aerobic and anaerobic conditions, its metabolites were analyzed by HPLC and it was tested for plant growth promotion abilities, such as phosphate solubilization, siderophore and indol-3-acetic acid productions. In addition, genomic DNA was extracted, sequenced and de novo assembly was performed. Further, genome annotation, pan-genome analysis and selected genome analyses, such as CRISPR arrays detection, antibiotic resistance and secondary metabolite genes identification were carried out.
RESULTS AND DISCUSSION: The typical appearance characteristics of the strain include the formation of symplasmata in submerged liquid culture and the formation of pale yellow colonies on agar. The genetic information of the strain (in total 4.8 Mb) is divided between a chromosome and two plasmids. The strain lacks any CRISPR-Cas system but is equipped with four restriction-modification systems. The phenotypic analysis focused on growth under both aerobic and anaerobic conditions, as well as traits associated with plant growth promotion. At both levels (genomic and phenotypic), the production of siderophores, indoleacetic acid-derived growth promoters, gluconic acid, and enzyme activities related to the degradation of complex organic compounds were found. Extracellular gluconic acid production under aerobic conditions (up to 8 g/l) is probably the result of glucose oxidation by the membrane-bound pyrroloquinoline quinone-dependent enzyme glucose dehydrogenase. The strain has a number of properties potentially beneficial to the hop plant and its closest relatives include the strains also isolated from the aerial parts of plants, yet its safety profile needs to be addressed in follow-up research.
Additional Links: PMID-38389535
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38389535,
year = {2024},
author = {Patakova, P and Vasylkivska, M and Sedlar, K and Jureckova, K and Bezdicek, M and Lovecka, P and Branska, B and Kastanek, P and Krofta, K},
title = {Whole genome sequencing and characterization of Pantoea agglomerans DBM 3797, endophyte, isolated from fresh hop (Humulus lupulus L.).},
journal = {Frontiers in microbiology},
volume = {15},
number = {},
pages = {1305338},
pmid = {38389535},
issn = {1664-302X},
abstract = {BACKGROUND: This paper brings new information about the genome and phenotypic characteristics of Pantoea agglomerans strain DBM 3797, isolated from fresh Czech hop (Humulus lupulus) in the Saaz hop-growing region. Although P. agglomerans strains are frequently isolated from different materials, there are not usually thoroughly characterized even if they have versatile metabolism and those isolated from plants may have a considerable potential for application in agriculture as a support culture for plant growth.
METHODS: P. agglomerans DBM 3797 was cultured under aerobic and anaerobic conditions, its metabolites were analyzed by HPLC and it was tested for plant growth promotion abilities, such as phosphate solubilization, siderophore and indol-3-acetic acid productions. In addition, genomic DNA was extracted, sequenced and de novo assembly was performed. Further, genome annotation, pan-genome analysis and selected genome analyses, such as CRISPR arrays detection, antibiotic resistance and secondary metabolite genes identification were carried out.
RESULTS AND DISCUSSION: The typical appearance characteristics of the strain include the formation of symplasmata in submerged liquid culture and the formation of pale yellow colonies on agar. The genetic information of the strain (in total 4.8 Mb) is divided between a chromosome and two plasmids. The strain lacks any CRISPR-Cas system but is equipped with four restriction-modification systems. The phenotypic analysis focused on growth under both aerobic and anaerobic conditions, as well as traits associated with plant growth promotion. At both levels (genomic and phenotypic), the production of siderophores, indoleacetic acid-derived growth promoters, gluconic acid, and enzyme activities related to the degradation of complex organic compounds were found. Extracellular gluconic acid production under aerobic conditions (up to 8 g/l) is probably the result of glucose oxidation by the membrane-bound pyrroloquinoline quinone-dependent enzyme glucose dehydrogenase. The strain has a number of properties potentially beneficial to the hop plant and its closest relatives include the strains also isolated from the aerial parts of plants, yet its safety profile needs to be addressed in follow-up research.},
}
RevDate: 2024-02-23
Pig pangenome graph reveals functional features of non-reference sequences.
Journal of animal science and biotechnology, 15(1):32.
BACKGROUND: The reliance on a solitary linear reference genome has imposed a significant constraint on our comprehensive understanding of genetic variation in animals. This constraint is particularly pronounced for non-reference sequences (NRSs), which have not been extensively studied.
RESULTS: In this study, we constructed a pig pangenome graph using 21 pig assemblies and identified 23,831 NRSs with a total length of 105 Mb. Our findings revealed that NRSs were more prevalent in breeds exhibiting greater genetic divergence from the reference genome. Furthermore, we observed that NRSs were rarely found within coding sequences, while NRS insertions were enriched in immune-related Gene Ontology terms. Notably, our investigation also unveiled a close association between novel genes and the immune capacity of pigs. We observed substantial differences in terms of frequencies of NRSs between Eastern and Western pigs, and the heat-resistant pigs exhibited a substantial number of NRS insertions in an 11.6 Mb interval on chromosome X. Additionally, we discovered a 665 bp insertion in the fourth intron of the TNFRSF19 gene that may be associated with the ability of heat tolerance in Southern Chinese pigs.
CONCLUSIONS: Our findings demonstrate the potential of a graph genome approach to reveal important functional features of NRSs in pig populations.
Additional Links: PMID-38389084
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38389084,
year = {2024},
author = {Miao, J and Wei, X and Cao, C and Sun, J and Xu, Y and Zhang, Z and Wang, Q and Pan, Y and Wang, Z},
title = {Pig pangenome graph reveals functional features of non-reference sequences.},
journal = {Journal of animal science and biotechnology},
volume = {15},
number = {1},
pages = {32},
pmid = {38389084},
issn = {1674-9782},
support = {2022YFF1000500//National Key Research and Development Program of China/ ; 31941007//National Natural Science Foundation of China/ ; 2016C02054-2//Zhejiang province agriculture (livestock) varieties breeding Key Technology R&D Program/ ; },
abstract = {BACKGROUND: The reliance on a solitary linear reference genome has imposed a significant constraint on our comprehensive understanding of genetic variation in animals. This constraint is particularly pronounced for non-reference sequences (NRSs), which have not been extensively studied.
RESULTS: In this study, we constructed a pig pangenome graph using 21 pig assemblies and identified 23,831 NRSs with a total length of 105 Mb. Our findings revealed that NRSs were more prevalent in breeds exhibiting greater genetic divergence from the reference genome. Furthermore, we observed that NRSs were rarely found within coding sequences, while NRS insertions were enriched in immune-related Gene Ontology terms. Notably, our investigation also unveiled a close association between novel genes and the immune capacity of pigs. We observed substantial differences in terms of frequencies of NRSs between Eastern and Western pigs, and the heat-resistant pigs exhibited a substantial number of NRS insertions in an 11.6 Mb interval on chromosome X. Additionally, we discovered a 665 bp insertion in the fourth intron of the TNFRSF19 gene that may be associated with the ability of heat tolerance in Southern Chinese pigs.
CONCLUSIONS: Our findings demonstrate the potential of a graph genome approach to reveal important functional features of NRSs in pig populations.},
}
RevDate: 2024-02-22
Comparative pangenomic analysis of Campylobacter fetus isolated from Spanish bulls and other mammalian species.
Scientific reports, 14(1):4347.
Campylobacter fetus comprises two closely related mammal-associated subspecies: Campylobacter fetus subsp. fetus (Cff) and Campylobacter fetus subsp. venerealis (Cfv). The latter causes bovine genital campylobacteriosis, a sexually-transmitted disease endemic in Spain that results in significant economic losses in the cattle industry. Here, 33 C. fetus Spanish isolates were whole-genome sequenced and compared with 62 publicly available C. fetus genomes from other countries. Genome-based taxonomic identification revealed high concordance with in silico PCR, confirming Spanish isolates as Cff (n = 4), Cfv (n = 9) and Cfv biovar intermedius (Cfvi, n = 20). MLST analysis assigned the Spanish isolates to 6 STs, including three novel: ST-76 and ST-77 for Cfv and ST-78 for Cff. Core genome SNP phylogenetic analysis of the 95 genomes identified multiple clusters, revealing associations at subspecies and biovar level between genomes with the same ST and separating the Cfvi genomes from Spain and other countries. A genome-wide association study identified pqqL as a Cfv-specific gene and a potential candidate for more accurate identification methods. Functionality analysis revealed variations in the accessory genome of C. fetus subspecies and biovars that deserve further studies. These results provide valuable information about the regional variants of C. fetus present in Spain and the genetic diversity and predicted functionality of the different subspecies.
Additional Links: PMID-38388650
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38388650,
year = {2024},
author = {Pena-Fernández, N and Ocejo, M and van der Graaf-van Bloois, L and Lavín, JL and Kortabarria, N and Collantes-Fernández, E and Hurtado, A and Aduriz, G},
title = {Comparative pangenomic analysis of Campylobacter fetus isolated from Spanish bulls and other mammalian species.},
journal = {Scientific reports},
volume = {14},
number = {1},
pages = {4347},
pmid = {38388650},
issn = {2045-2322},
support = {Pre2018-086113 funded by MCIN/AEI/ 10.13039/501100011033 and by "ESF Investing in your future"//Ministerio de Ciencia e Innovación/ ; },
abstract = {Campylobacter fetus comprises two closely related mammal-associated subspecies: Campylobacter fetus subsp. fetus (Cff) and Campylobacter fetus subsp. venerealis (Cfv). The latter causes bovine genital campylobacteriosis, a sexually-transmitted disease endemic in Spain that results in significant economic losses in the cattle industry. Here, 33 C. fetus Spanish isolates were whole-genome sequenced and compared with 62 publicly available C. fetus genomes from other countries. Genome-based taxonomic identification revealed high concordance with in silico PCR, confirming Spanish isolates as Cff (n = 4), Cfv (n = 9) and Cfv biovar intermedius (Cfvi, n = 20). MLST analysis assigned the Spanish isolates to 6 STs, including three novel: ST-76 and ST-77 for Cfv and ST-78 for Cff. Core genome SNP phylogenetic analysis of the 95 genomes identified multiple clusters, revealing associations at subspecies and biovar level between genomes with the same ST and separating the Cfvi genomes from Spain and other countries. A genome-wide association study identified pqqL as a Cfv-specific gene and a potential candidate for more accurate identification methods. Functionality analysis revealed variations in the accessory genome of C. fetus subspecies and biovars that deserve further studies. These results provide valuable information about the regional variants of C. fetus present in Spain and the genetic diversity and predicted functionality of the different subspecies.},
}
RevDate: 2024-02-22
Impact of homologous recombination on core genome evolution and host adaptation of Pectobacterium parmentieri.
Genome biology and evolution pii:7612553 [Epub ahead of print].
Homologous recombination is a major force mechanism driving bacterial evolution, host adaptability and acquisition of novel virulence traits. Pectobacterium parmentieri is a plant bacterial pathogen distributed worldwide, primarily affecting potatoes, by causing soft rot and blackleg diseases. The goal of this investigation was to understand the impact of homologous recombination on the genomic evolution of P. parmentieri. Analysis of P. parmentieri genomes using Roary revealed a dynamic pan-genome with 3,742 core genes and over 55% accessory genome variability. Bayesian population structure analysis identified seven lineages, indicating species heterogeneity. ClonalFrameML analysis displayed 5,125 recombination events, with the lineage 4 exhibiting the highest events. fastGEAR analysis identified 486 ancestral and 941 recent recombination events ranging 43 bp - 119 kb and 36 bp - 13.96 kb, respectively, suggesting ongoing adaptation. Notably, 11% (412 genes) of the core genome underwent recent recombination, with lineage 1 as the main donor. The prevalence of recent recombination (double compared to ancient) events implies continuous adaptation, possibly driven by global potato trade. Recombination events were found in genes involved in vital cellular processes (DNA replication, DNA repair, RNA processing, homeostasis, and metabolism), pathogenicity determinants (type secretion systems, cell-wall degrading enzymes, iron scavengers, lipopolysaccharides, flagellum, etc.), antimicrobial compounds (phenazine and colicin) and even CRISPR-Cas genes. Overall, these results emphasize the potential role of homologous recombination in P. parmentieri's evolutionary dynamics, influencing host colonization, pathogenicity, adaptive immunity, and ecological fitness.
Additional Links: PMID-38385549
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38385549,
year = {2024},
author = {Arizala, D and Arif, M},
title = {Impact of homologous recombination on core genome evolution and host adaptation of Pectobacterium parmentieri.},
journal = {Genome biology and evolution},
volume = {},
number = {},
pages = {},
doi = {10.1093/gbe/evae032},
pmid = {38385549},
issn = {1759-6653},
abstract = {Homologous recombination is a major force mechanism driving bacterial evolution, host adaptability and acquisition of novel virulence traits. Pectobacterium parmentieri is a plant bacterial pathogen distributed worldwide, primarily affecting potatoes, by causing soft rot and blackleg diseases. The goal of this investigation was to understand the impact of homologous recombination on the genomic evolution of P. parmentieri. Analysis of P. parmentieri genomes using Roary revealed a dynamic pan-genome with 3,742 core genes and over 55% accessory genome variability. Bayesian population structure analysis identified seven lineages, indicating species heterogeneity. ClonalFrameML analysis displayed 5,125 recombination events, with the lineage 4 exhibiting the highest events. fastGEAR analysis identified 486 ancestral and 941 recent recombination events ranging 43 bp - 119 kb and 36 bp - 13.96 kb, respectively, suggesting ongoing adaptation. Notably, 11% (412 genes) of the core genome underwent recent recombination, with lineage 1 as the main donor. The prevalence of recent recombination (double compared to ancient) events implies continuous adaptation, possibly driven by global potato trade. Recombination events were found in genes involved in vital cellular processes (DNA replication, DNA repair, RNA processing, homeostasis, and metabolism), pathogenicity determinants (type secretion systems, cell-wall degrading enzymes, iron scavengers, lipopolysaccharides, flagellum, etc.), antimicrobial compounds (phenazine and colicin) and even CRISPR-Cas genes. Overall, these results emphasize the potential role of homologous recombination in P. parmentieri's evolutionary dynamics, influencing host colonization, pathogenicity, adaptive immunity, and ecological fitness.},
}
RevDate: 2024-02-22
Pangenomic analyses of tuberculosis strains to identify resistomes using computational approaches.
JPMA. The Journal of the Pakistan Medical Association, 74(1 (Supple-2)):S74-S78.
OBJECTIVE: To locate resistomes in tuberculosis strains, to determine the severity of drug resistance, and to infer its implications with respect to high tuberculosis prevalence in a Third World setting.
METHODS: The pangenomic study was conducted from October 2022 to January 2023 in Sir Syed University of Engineering and Technology, Karachi, and comprised 2012-22 data on multiple sequence alignment to assess the genetic evolution of tuberculosis strains. Antibiotic resistance drug classes were identified using the Canadian Antibiotic Resistance Database, which entailed multidrug-resistant and extremely drug-resistant strains. Also, GenBank was used for tuberculosis genome FASTA (fast-all; nucleotide and protein sequence representation) files, prediction of resistome sequences on the basis of Canadian Antibiotic Resistance Database, and multiple sequence alignment was done in Mauve.
RESULTS: Evolutionarily, the 6 strains identified were structurally similar with polymorphisms in their core chromosomal regions. Their resistome genes showed perfect hits for isoniazid, rifamycin, cephalosporin, fluoroquinolone, aminoglycosides, penem, penam and cephamycin.
CONCLUSION: Drugs discovered in antibiotic resistance genes are now less effective in treatment, and have the potential to develop into more dangerous bacteria, if not monitored. For treatment, staying long durations in hospitals for quality healthcare and supervision in third world countries is unaffordable.
Additional Links: PMID-38385476
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38385476,
year = {2024},
author = {Tariq, DE},
title = {Pangenomic analyses of tuberculosis strains to identify resistomes using computational approaches.},
journal = {JPMA. The Journal of the Pakistan Medical Association},
volume = {74},
number = {1 (Supple-2)},
pages = {S74-S78},
doi = {10.47391/JPMA-DUHS-S15},
pmid = {38385476},
issn = {0030-9982},
abstract = {OBJECTIVE: To locate resistomes in tuberculosis strains, to determine the severity of drug resistance, and to infer its implications with respect to high tuberculosis prevalence in a Third World setting.
METHODS: The pangenomic study was conducted from October 2022 to January 2023 in Sir Syed University of Engineering and Technology, Karachi, and comprised 2012-22 data on multiple sequence alignment to assess the genetic evolution of tuberculosis strains. Antibiotic resistance drug classes were identified using the Canadian Antibiotic Resistance Database, which entailed multidrug-resistant and extremely drug-resistant strains. Also, GenBank was used for tuberculosis genome FASTA (fast-all; nucleotide and protein sequence representation) files, prediction of resistome sequences on the basis of Canadian Antibiotic Resistance Database, and multiple sequence alignment was done in Mauve.
RESULTS: Evolutionarily, the 6 strains identified were structurally similar with polymorphisms in their core chromosomal regions. Their resistome genes showed perfect hits for isoniazid, rifamycin, cephalosporin, fluoroquinolone, aminoglycosides, penem, penam and cephamycin.
CONCLUSION: Drugs discovered in antibiotic resistance genes are now less effective in treatment, and have the potential to develop into more dangerous bacteria, if not monitored. For treatment, staying long durations in hospitals for quality healthcare and supervision in third world countries is unaffordable.},
}
RevDate: 2024-02-22
High clonality of Mycobacterium avium subsp. paratuberculosis field isolates from red deer revealed by two different methodological approaches of comparative genomic analysis.
Frontiers in veterinary science, 11:1301667.
Mycobacterium avium subsp. paratuberculosis (MAP) is the aetiological agent of paratuberculosis (Johne's disease) in both domestic and wild ruminants. In the present study, using a whole-genome sequence (WGS) approach, we investigated the genetic diversity of 15 Mycobacterium avium field strains isolated in the last 10 years from red deer inhabiting the Stelvio National Park and affected by paratuberculosis. Combining de novo assembly and a reference-based method, followed by a pangenome analysis, we highlight a very close relationship among 13 MAP field isolates, suggesting that a single infecting event occurred in this population. Moreover, two isolates have been classified as Mycobacterium avium subsp. hominissuis, distinct from the other MAPs under comparison but close to each other. This is the first time that this subspecies has been found in Italy in samples without evident epidemiological correlations, having been isolated in two different locations of the Stelvio National Park and in different years. Our study highlights the importance of a multidisciplinary approach incorporating molecular epidemiology and ecology into traditional infectious disease knowledge in order to investigate the nature of infectious disease in wildlife populations.
Additional Links: PMID-38379925
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38379925,
year = {2024},
author = {Turco, S and Russo, S and Pietrucci, D and Filippi, A and Milanesi, M and Luzzago, C and Garbarino, C and Palladini, G and Chillemi, G and Ricchi, M},
title = {High clonality of Mycobacterium avium subsp. paratuberculosis field isolates from red deer revealed by two different methodological approaches of comparative genomic analysis.},
journal = {Frontiers in veterinary science},
volume = {11},
number = {},
pages = {1301667},
pmid = {38379925},
issn = {2297-1769},
abstract = {Mycobacterium avium subsp. paratuberculosis (MAP) is the aetiological agent of paratuberculosis (Johne's disease) in both domestic and wild ruminants. In the present study, using a whole-genome sequence (WGS) approach, we investigated the genetic diversity of 15 Mycobacterium avium field strains isolated in the last 10 years from red deer inhabiting the Stelvio National Park and affected by paratuberculosis. Combining de novo assembly and a reference-based method, followed by a pangenome analysis, we highlight a very close relationship among 13 MAP field isolates, suggesting that a single infecting event occurred in this population. Moreover, two isolates have been classified as Mycobacterium avium subsp. hominissuis, distinct from the other MAPs under comparison but close to each other. This is the first time that this subspecies has been found in Italy in samples without evident epidemiological correlations, having been isolated in two different locations of the Stelvio National Park and in different years. Our study highlights the importance of a multidisciplinary approach incorporating molecular epidemiology and ecology into traditional infectious disease knowledge in order to investigate the nature of infectious disease in wildlife populations.},
}
RevDate: 2024-02-20
Plant pangenomes for crop improvement, biodiversity and evolution.
Nature reviews. Genetics [Epub ahead of print].
Plant genome sequences catalogue genes and the genetic elements that regulate their expression. Such inventories further research aims as diverse as mapping the molecular basis of trait diversity in domesticated plants or inquiries into the origin of evolutionary innovations in flowering plants millions of years ago. The transformative technological progress of DNA sequencing in the past two decades has enabled researchers to sequence ever more genomes with greater ease. Pangenomes - complete sequences of multiple individuals of a species or higher taxonomic unit - have now entered the geneticists' toolkit. The genomes of crop plants and their wild relatives are being studied with translational applications in breeding in mind. But pangenomes are applicable also in ecological and evolutionary studies, as they help classify and monitor biodiversity across the tree of life, deepen our understanding of how plant species diverged and show how plants adapt to changing environments or new selection pressures exerted by human beings.
Additional Links: PMID-38378816
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38378816,
year = {2024},
author = {Schreiber, M and Jayakodi, M and Stein, N and Mascher, M},
title = {Plant pangenomes for crop improvement, biodiversity and evolution.},
journal = {Nature reviews. Genetics},
volume = {},
number = {},
pages = {},
pmid = {38378816},
issn = {1471-0064},
abstract = {Plant genome sequences catalogue genes and the genetic elements that regulate their expression. Such inventories further research aims as diverse as mapping the molecular basis of trait diversity in domesticated plants or inquiries into the origin of evolutionary innovations in flowering plants millions of years ago. The transformative technological progress of DNA sequencing in the past two decades has enabled researchers to sequence ever more genomes with greater ease. Pangenomes - complete sequences of multiple individuals of a species or higher taxonomic unit - have now entered the geneticists' toolkit. The genomes of crop plants and their wild relatives are being studied with translational applications in breeding in mind. But pangenomes are applicable also in ecological and evolutionary studies, as they help classify and monitor biodiversity across the tree of life, deepen our understanding of how plant species diverged and show how plants adapt to changing environments or new selection pressures exerted by human beings.},
}
RevDate: 2024-02-20
The evolutionary phylodynamics of human parechovirus A type 3 reveal multiple recombination events in South Korea.
Journal of medical virology, 96(2):e29477.
Human parechovirus A (HPeV-A) is a causative agent of respiratory and gastrointestinal illnesses, acute flaccid paralysis encephalitis, meningitis, and neonatal sepsis. To clarify the characteristics of HPeV-A infection in children, 391 fecal specimens were collected from January 2014 to October 2015 from patients with acute gastroenteritis in Seoul, South Korea. Of these, 221/391 (56.5%) HPeV-A positive samples were found in children less than 2 years old. Three HPeV-A genotypes HPeV-A1 (117/221; 52.94%), HPeV-A3 (100/221; 45.25%), and HPeV-A6 (4/221; 1.81%) were detected, among which HPeV-A3 was predominant with the highest recorded value of 58.6% in 2015. Moreover, recombination events in the Korean HPeV-A3 strains were detected. Phylogenetic analysis revealed that the capsid-encoding regions and noncapsid gene 2A of the four Korean HPeV-A3 strains are closely related to the HPeV-A3 strains isolated in Canada in 2007 (Can82853-01), Japan in 2008 (A308/99), and Taiwan in 2011 (TW-03067-2011) while noncapsid genes P2 (2B-2C) and P3 (3A-3D) are closely related to those of HPeV-A1 strains BNI-788St (Germany in 2008) and TW-71594-2010 (Taiwan in 2010). This first report on the whole-genome analysis of HPeV-A3 in Korea provides insight into the evolving status and pathogenesis of HPeVs in children.
Additional Links: PMID-38376942
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38376942,
year = {2024},
author = {Truong, TC and Park, H and Kim, JH and Tran, VT and Kim, W},
title = {The evolutionary phylodynamics of human parechovirus A type 3 reveal multiple recombination events in South Korea.},
journal = {Journal of medical virology},
volume = {96},
number = {2},
pages = {e29477},
doi = {10.1002/jmv.29477},
pmid = {38376942},
issn = {1096-9071},
support = {NRF-2021R1C1C2003223//National Research Foundation of Korea/ ; NRF-2022R1A2C2012209//National Research Foundation of Korea/ ; },
abstract = {Human parechovirus A (HPeV-A) is a causative agent of respiratory and gastrointestinal illnesses, acute flaccid paralysis encephalitis, meningitis, and neonatal sepsis. To clarify the characteristics of HPeV-A infection in children, 391 fecal specimens were collected from January 2014 to October 2015 from patients with acute gastroenteritis in Seoul, South Korea. Of these, 221/391 (56.5%) HPeV-A positive samples were found in children less than 2 years old. Three HPeV-A genotypes HPeV-A1 (117/221; 52.94%), HPeV-A3 (100/221; 45.25%), and HPeV-A6 (4/221; 1.81%) were detected, among which HPeV-A3 was predominant with the highest recorded value of 58.6% in 2015. Moreover, recombination events in the Korean HPeV-A3 strains were detected. Phylogenetic analysis revealed that the capsid-encoding regions and noncapsid gene 2A of the four Korean HPeV-A3 strains are closely related to the HPeV-A3 strains isolated in Canada in 2007 (Can82853-01), Japan in 2008 (A308/99), and Taiwan in 2011 (TW-03067-2011) while noncapsid genes P2 (2B-2C) and P3 (3A-3D) are closely related to those of HPeV-A1 strains BNI-788St (Germany in 2008) and TW-71594-2010 (Taiwan in 2010). This first report on the whole-genome analysis of HPeV-A3 in Korea provides insight into the evolving status and pathogenesis of HPeVs in children.},
}
RevDate: 2024-02-20
A validated pangenome-scale metabolic model for the Klebsiella pneumoniae species complex.
Microbial genomics, 10(2):.
The Klebsiella pneumoniae species complex (KpSC) is a major source of nosocomial infections globally with high rates of resistance to antimicrobials. Consequently, there is growing interest in understanding virulence factors and their association with cellular metabolic processes for developing novel anti-KpSC therapeutics. Phenotypic assays have revealed metabolic diversity within the KpSC, but metabolism research has been neglected due to experiments being difficult and cost-intensive. Genome-scale metabolic models (GSMMs) represent a rapid and scalable in silico approach for exploring metabolic diversity, which compile genomic and biochemical data to reconstruct the metabolic network of an organism. Here we use a diverse collection of 507 KpSC isolates, including representatives of globally distributed clinically relevant lineages, to construct the most comprehensive KpSC pan-metabolic model to date, KpSC pan v2. Candidate metabolic reactions were identified using gene orthology to known metabolic genes, prior to manual curation via extensive literature and database searches. The final model comprised a total of 3550 reactions, 2403 genes and can simulate growth on 360 unique substrates. We used KpSC pan v2 as a reference to derive strain-specific GSMMs for all 507 KpSC isolates, and compared these to GSMMs generated using a prior KpSC pan-reference (KpSC pan v1) and two single-strain references. We show that KpSC pan v2 includes a greater proportion of accessory reactions (8.8 %) than KpSC pan v1 (2.5 %). GSMMs derived from KpSC pan v2 also generate more accurate growth predictions, with high median accuracies of 95.4 % (aerobic, n=37 isolates) and 78.8 % (anaerobic, n=36 isolates) for 124 matched carbon substrates. KpSC pan v2 is freely available at https://github.com/kelwyres/KpSC-pan-metabolic-model, representing a valuable resource for the scientific community, both as a source of curated metabolic information and as a reference to derive accurate strain-specific GSMMs. The latter can be used to investigate the relationship between KpSC metabolism and traits of interest, such as reservoirs, epidemiology, drug resistance or virulence, and ultimately to inform novel KpSC control strategies.
Additional Links: PMID-38376382
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38376382,
year = {2024},
author = {Cooper, HB and Vezina, B and Hawkey, J and Passet, V and López-Fernández, S and Monk, JM and Brisse, S and Holt, KE and Wyres, KL},
title = {A validated pangenome-scale metabolic model for the Klebsiella pneumoniae species complex.},
journal = {Microbial genomics},
volume = {10},
number = {2},
pages = {},
doi = {10.1099/mgen.0.001206},
pmid = {38376382},
issn = {2057-5858},
abstract = {The Klebsiella pneumoniae species complex (KpSC) is a major source of nosocomial infections globally with high rates of resistance to antimicrobials. Consequently, there is growing interest in understanding virulence factors and their association with cellular metabolic processes for developing novel anti-KpSC therapeutics. Phenotypic assays have revealed metabolic diversity within the KpSC, but metabolism research has been neglected due to experiments being difficult and cost-intensive. Genome-scale metabolic models (GSMMs) represent a rapid and scalable in silico approach for exploring metabolic diversity, which compile genomic and biochemical data to reconstruct the metabolic network of an organism. Here we use a diverse collection of 507 KpSC isolates, including representatives of globally distributed clinically relevant lineages, to construct the most comprehensive KpSC pan-metabolic model to date, KpSC pan v2. Candidate metabolic reactions were identified using gene orthology to known metabolic genes, prior to manual curation via extensive literature and database searches. The final model comprised a total of 3550 reactions, 2403 genes and can simulate growth on 360 unique substrates. We used KpSC pan v2 as a reference to derive strain-specific GSMMs for all 507 KpSC isolates, and compared these to GSMMs generated using a prior KpSC pan-reference (KpSC pan v1) and two single-strain references. We show that KpSC pan v2 includes a greater proportion of accessory reactions (8.8 %) than KpSC pan v1 (2.5 %). GSMMs derived from KpSC pan v2 also generate more accurate growth predictions, with high median accuracies of 95.4 % (aerobic, n=37 isolates) and 78.8 % (anaerobic, n=36 isolates) for 124 matched carbon substrates. KpSC pan v2 is freely available at https://github.com/kelwyres/KpSC-pan-metabolic-model, representing a valuable resource for the scientific community, both as a source of curated metabolic information and as a reference to derive accurate strain-specific GSMMs. The latter can be used to investigate the relationship between KpSC metabolism and traits of interest, such as reservoirs, epidemiology, drug resistance or virulence, and ultimately to inform novel KpSC control strategies.},
}
RevDate: 2024-02-20
(Pan)genomic analysis of two Rhodococcus isolates and their role in phenolic compound degradation.
Microbiology spectrum [Epub ahead of print].
The genus Rhodococcus is recognized for its potential to degrade a large range of aromatic substances, including plant-derived phenolic compounds. We used comparative genomics in the context of the broader Rhodococcus pan-genome to study genomic traits of two newly described Rhodococcus strains (type-strain Rhodococcus pseudokoreensis R79[T] and Rhodococcus koreensis R85) isolated from apple rhizosphere. Of particular interest was their ability to degrade phenolic compounds as part of an integrated approach to treat apple replant disease (ARD) syndrome. The pan-genome of the genus Rhodococcus based on 109 high-quality genomes was open with a small core (1.3%) consisting of genes assigned to basic cell functioning. The range of genome sizes in Rhodococcus was high, from 3.7 to 10.9 Mbp. Genomes from host-associated strains were generally smaller compared to environmental isolates which were characterized by exceptionally large genome sizes. Due to large genomic differences, we propose the reclassification of distinct groups of rhodococci like the Rhodococcus equi cluster to new genera. Taxonomic species affiliation was the most important factor in predicting genetic content and clustering of the genomes. Additionally, we found genes that discriminated between the strains based on habitat. All members of the genus Rhodococcus had at least one gene involved in the pathway for the degradation of benzoate, while biphenyl degradation was mainly restricted to strains in close phylogenetic relationships with our isolates. The ~40% of genes still unclassified in larger Rhodococcus genomes, particularly those of environmental isolates, need more research to explore the metabolic potential of this genus.IMPORTANCERhodococcus is a diverse, metabolically powerful genus, with high potential to adapt to different habitats due to the linear plasmids and large genome sizes. The analysis of its pan-genome allowed us to separate host-associated from environmental strains, supporting taxonomic reclassification. It was shown which genes contribute to the differentiation of the genomes based on habitat, which can possibly be used for targeted isolation and screening for desired traits. With respect to apple replant disease (ARD), our isolates showed genome traits that suggest potential for application in reducing plant-derived phenolic substances in soil, which makes them good candidates for further testing against ARD.
Additional Links: PMID-38376357
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38376357,
year = {2024},
author = {Benning, S and Pritsch, K and Radl, V and Siani, R and Wang, Z and Schloter, M},
title = {(Pan)genomic analysis of two Rhodococcus isolates and their role in phenolic compound degradation.},
journal = {Microbiology spectrum},
volume = {},
number = {},
pages = {e0378323},
doi = {10.1128/spectrum.03783-23},
pmid = {38376357},
issn = {2165-0497},
abstract = {The genus Rhodococcus is recognized for its potential to degrade a large range of aromatic substances, including plant-derived phenolic compounds. We used comparative genomics in the context of the broader Rhodococcus pan-genome to study genomic traits of two newly described Rhodococcus strains (type-strain Rhodococcus pseudokoreensis R79[T] and Rhodococcus koreensis R85) isolated from apple rhizosphere. Of particular interest was their ability to degrade phenolic compounds as part of an integrated approach to treat apple replant disease (ARD) syndrome. The pan-genome of the genus Rhodococcus based on 109 high-quality genomes was open with a small core (1.3%) consisting of genes assigned to basic cell functioning. The range of genome sizes in Rhodococcus was high, from 3.7 to 10.9 Mbp. Genomes from host-associated strains were generally smaller compared to environmental isolates which were characterized by exceptionally large genome sizes. Due to large genomic differences, we propose the reclassification of distinct groups of rhodococci like the Rhodococcus equi cluster to new genera. Taxonomic species affiliation was the most important factor in predicting genetic content and clustering of the genomes. Additionally, we found genes that discriminated between the strains based on habitat. All members of the genus Rhodococcus had at least one gene involved in the pathway for the degradation of benzoate, while biphenyl degradation was mainly restricted to strains in close phylogenetic relationships with our isolates. The ~40% of genes still unclassified in larger Rhodococcus genomes, particularly those of environmental isolates, need more research to explore the metabolic potential of this genus.IMPORTANCERhodococcus is a diverse, metabolically powerful genus, with high potential to adapt to different habitats due to the linear plasmids and large genome sizes. The analysis of its pan-genome allowed us to separate host-associated from environmental strains, supporting taxonomic reclassification. It was shown which genes contribute to the differentiation of the genomes based on habitat, which can possibly be used for targeted isolation and screening for desired traits. With respect to apple replant disease (ARD), our isolates showed genome traits that suggest potential for application in reducing plant-derived phenolic substances in soil, which makes them good candidates for further testing against ARD.},
}
RevDate: 2024-02-20
Impressive pan-genomic diversity of E. coli from a wild animal community near urban development reflects human impacts.
iScience, 27(3):109072.
Human and domesticated animal waste infiltrates global freshwater, terrestrial, and marine environments, widely disseminating fecal microbes, antibiotics, and other chemical pollutants. Emerging evidence suggests that guts of wild animals are being invaded by our microbes, including Escherichia coli, which face anthropogenic selective pressures to gain antimicrobial resistance (AMR) and increase virulence. However, wild animal sources remain starkly under-represented among genomic sequence repositories. We sequenced whole genomes of 145 E. coli isolates from 55 wild and 13 domestic animal fecal samples, averaging 2 (ranging 1-7) isolates per sample, on a preserve imbedded in a human-dominated landscape in California Bay Area, USA, to assess AMR, virulence, and pan-genomic diversity. With single nucleotide polymorphism analyses we predict potential transmission routes. We illustrate the usefulness of E. coli to aid our understanding of and ability to surveil the emergence of zoonotic pathogens created by the mixing of human and wild bacteria in the environment.
Additional Links: PMID-38375235
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38375235,
year = {2024},
author = {Lagerstrom, KM and Scales, NC and Hadly, EA},
title = {Impressive pan-genomic diversity of E. coli from a wild animal community near urban development reflects human impacts.},
journal = {iScience},
volume = {27},
number = {3},
pages = {109072},
pmid = {38375235},
issn = {2589-0042},
abstract = {Human and domesticated animal waste infiltrates global freshwater, terrestrial, and marine environments, widely disseminating fecal microbes, antibiotics, and other chemical pollutants. Emerging evidence suggests that guts of wild animals are being invaded by our microbes, including Escherichia coli, which face anthropogenic selective pressures to gain antimicrobial resistance (AMR) and increase virulence. However, wild animal sources remain starkly under-represented among genomic sequence repositories. We sequenced whole genomes of 145 E. coli isolates from 55 wild and 13 domestic animal fecal samples, averaging 2 (ranging 1-7) isolates per sample, on a preserve imbedded in a human-dominated landscape in California Bay Area, USA, to assess AMR, virulence, and pan-genomic diversity. With single nucleotide polymorphism analyses we predict potential transmission routes. We illustrate the usefulness of E. coli to aid our understanding of and ability to surveil the emergence of zoonotic pathogens created by the mixing of human and wild bacteria in the environment.},
}
RevDate: 2024-02-19
Global diversity, recurrent evolution, and recent selection on amylase structural haplotypes in humans.
bioRxiv : the preprint server for biology pii:2024.02.07.579378.
The adoption of agriculture, first documented ∼12,000 years ago in the Fertile Crescent, triggered a rapid shift toward starch-rich diets in human populations. Amylase genes facilitate starch digestion and increased salivary amylase copy number has been observed in some modern human populations with high starch intake, though evidence of recent selection is lacking. Here, using 52 long-read diploid assemblies and short read data from ∼5,600 contemporary and ancient humans, we resolve the diversity, evolutionary history, and selective impact of structural variation at the amylase locus. We find that both salivary and pancreatic amylase genes have higher copy numbers in populations with agricultural subsistence compared to fishing, hunting, and pastoral groups. We identify 28 distinct amylase structural architectures and demonstrate that identical structures have arisen independently multiple times throughout recent human history. Using a pangenome graph-based approach to infer structural haplotypes across thousands of humans, we identify extensively duplicated haplotypes present at higher frequencies in modern agricultural populations. Leveraging 534 ancient human genomes we find that duplication-containing haplotypes have increased in frequency more than seven-fold over the last 12,000 years providing evidence for recent selection in Eurasians at this locus comparable in magnitude to that at lactase. Together, our study highlights the strong impact of the agricultural revolution on human genomes and the importance of long-read sequencing in identifying signatures of selection at structurally complex loci.
Additional Links: PMID-38370750
Full Text:
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38370750,
year = {2024},
author = {Bolognini, D and Halgren, A and Lou, RN and Raveane, A and Rocha, JL and Guarracino, A and Soranzo, N and Chin, J and Garrison, E and Sudmant, PH},
title = {Global diversity, recurrent evolution, and recent selection on amylase structural haplotypes in humans.},
journal = {bioRxiv : the preprint server for biology},
volume = {},
number = {},
pages = {},
doi = {10.1101/2024.02.07.579378},
pmid = {38370750},
abstract = {The adoption of agriculture, first documented ∼12,000 years ago in the Fertile Crescent, triggered a rapid shift toward starch-rich diets in human populations. Amylase genes facilitate starch digestion and increased salivary amylase copy number has been observed in some modern human populations with high starch intake, though evidence of recent selection is lacking. Here, using 52 long-read diploid assemblies and short read data from ∼5,600 contemporary and ancient humans, we resolve the diversity, evolutionary history, and selective impact of structural variation at the amylase locus. We find that both salivary and pancreatic amylase genes have higher copy numbers in populations with agricultural subsistence compared to fishing, hunting, and pastoral groups. We identify 28 distinct amylase structural architectures and demonstrate that identical structures have arisen independently multiple times throughout recent human history. Using a pangenome graph-based approach to infer structural haplotypes across thousands of humans, we identify extensively duplicated haplotypes present at higher frequencies in modern agricultural populations. Leveraging 534 ancient human genomes we find that duplication-containing haplotypes have increased in frequency more than seven-fold over the last 12,000 years providing evidence for recent selection in Eurasians at this locus comparable in magnitude to that at lactase. Together, our study highlights the strong impact of the agricultural revolution on human genomes and the importance of long-read sequencing in identifying signatures of selection at structurally complex loci.},
}
RevDate: 2024-02-19
Diversity of Vibrio cholerae O1 through the human gastrointestinal tract during cholera.
bioRxiv : the preprint server for biology pii:2024.02.08.579476.
UNLABELLED: Vibrio cholerae O1 causes the diarrheal disease cholera, and the small intestine is the site of active infection. During cholera, cholera toxin is secreted from V. cholerae and induces a massive fluid influx into the small intestine, which causes vomiting and diarrhea. Typically, V. cholerae genomes are sequenced from bacteria passed in stool, but rarely from vomit, a fluid that may more closely represents the site of active infection. We hypothesized that the V. cholerae O1 population bottlenecks along the gastrointestinal tract would result in reduced genetic variation in stool compared to vomit. To test this, we sequenced V. cholerae genomes from ten cholera patients with paired vomit and stool samples. Genetic diversity was low in both vomit and stool, consistent with a single infecting population rather than co-infection with divergent V. cholerae O1 lineages. The number of single nucleotide variants decreased between vomit and stool in four patients, increased in two, and remained unchanged in four. The number of genes encoded in the V. cholerae genome decreased between vomit and stool in eight patients and increased in two. Pangenome analysis of assembled short-read sequencing demonstrated that the toxin-coregulated pilus operon more frequently contained deletions in genomes from vomit compared to stool. However, these deletions were not detected by PCR or long-read sequencing, indicating that interpreting gene presence or absence patterns from short-read data alone may be incomplete. Overall, we found that V. cholerae O1 isolated from stool is genetically similar to V. cholerae recovered from the upper intestinal tract.
IMPORTANCE: Vibrio cholerae O1, the bacterium that causes cholera, is ingested in contaminated food or water and then colonizes the upper small intestine and is excreted in stool. Shed V. cholerae genomes are usually studied, but V. cholerae isolated from vomit may be more representative of where V. cholerae colonizes in the upper intestinal epithelium. V. cholerae may experience bottlenecks, or large reductions in bacterial population sizes or genetic diversity, as it passes through the gut. Passage through the gut may select for distinct V. cholerae mutants that are adapted for survival and gut colonization. We did not find strong evidence for such adaptive mutations, and instead observed that passage through the gut results in modest reductions in V. cholerae genetic diversity, and only in some patients. These results fill a gap in our understanding of the V. cholerae life cycle, transmission, and evolution.
Additional Links: PMID-38370713
Full Text:
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38370713,
year = {2024},
author = {Lypaczewski, P and Chac, D and Dunmire, CN and Tandoc, KM and Chowdhury, F and Khan, AI and Bhuiyan, T and Harris, JB and LaRocque, RC and Calderwood, SB and Ryan, ET and Qadri, F and Shapiro, BJ and Weil, AA},
title = {Diversity of Vibrio cholerae O1 through the human gastrointestinal tract during cholera.},
journal = {bioRxiv : the preprint server for biology},
volume = {},
number = {},
pages = {},
doi = {10.1101/2024.02.08.579476},
pmid = {38370713},
abstract = {UNLABELLED: Vibrio cholerae O1 causes the diarrheal disease cholera, and the small intestine is the site of active infection. During cholera, cholera toxin is secreted from V. cholerae and induces a massive fluid influx into the small intestine, which causes vomiting and diarrhea. Typically, V. cholerae genomes are sequenced from bacteria passed in stool, but rarely from vomit, a fluid that may more closely represents the site of active infection. We hypothesized that the V. cholerae O1 population bottlenecks along the gastrointestinal tract would result in reduced genetic variation in stool compared to vomit. To test this, we sequenced V. cholerae genomes from ten cholera patients with paired vomit and stool samples. Genetic diversity was low in both vomit and stool, consistent with a single infecting population rather than co-infection with divergent V. cholerae O1 lineages. The number of single nucleotide variants decreased between vomit and stool in four patients, increased in two, and remained unchanged in four. The number of genes encoded in the V. cholerae genome decreased between vomit and stool in eight patients and increased in two. Pangenome analysis of assembled short-read sequencing demonstrated that the toxin-coregulated pilus operon more frequently contained deletions in genomes from vomit compared to stool. However, these deletions were not detected by PCR or long-read sequencing, indicating that interpreting gene presence or absence patterns from short-read data alone may be incomplete. Overall, we found that V. cholerae O1 isolated from stool is genetically similar to V. cholerae recovered from the upper intestinal tract.
IMPORTANCE: Vibrio cholerae O1, the bacterium that causes cholera, is ingested in contaminated food or water and then colonizes the upper small intestine and is excreted in stool. Shed V. cholerae genomes are usually studied, but V. cholerae isolated from vomit may be more representative of where V. cholerae colonizes in the upper intestinal epithelium. V. cholerae may experience bottlenecks, or large reductions in bacterial population sizes or genetic diversity, as it passes through the gut. Passage through the gut may select for distinct V. cholerae mutants that are adapted for survival and gut colonization. We did not find strong evidence for such adaptive mutations, and instead observed that passage through the gut results in modest reductions in V. cholerae genetic diversity, and only in some patients. These results fill a gap in our understanding of the V. cholerae life cycle, transmission, and evolution.},
}
RevDate: 2024-02-19
Genomic analysis of Ralstonia pickettii reveals the genetic features for potential pathogenicity and adaptive evolution in drinking water.
Frontiers in microbiology, 14:1272636.
Ralstonia pickettii, the most critical clinical pathogen of the genus Ralstonia, has been identified as a causative agent of numerous harmful infections. Additionally, Ralstonia pickettii demonstrates adaptability to extreme environmental conditions, such as those found in drinking water. In this study, we conducted a comprehensive genomic analysis to investigate the genomic characteristics related to potential pathogenicity and adaptive evolution in drinking water environments of Ralstonia pickettii. Through phylogenetic analysis and population genetic analysis, we divided Ralstonia pickettii into five Groups, two of which were associated with drinking water environments. The open pan-genome with a large and flexible gene repertoire indicated a high genetic plasticity. Significant differences in functional enrichment were observed between the core- and pan-genome of different groups. Diverse mobile genetic elements (MGEs), extensive genomic rearrangements, and horizontal gene transfer (HGT) events played a crucial role in generating genetic diversity. In drinking water environments, Ralstonia pickettii exhibited strong adaptability, and the acquisition of specific adaptive genes was potentially facilitated by genomic islands (GIs) and HGT. Furthermore, environmental pressures drove the adaptive evolution of Ralstonia pickettii, leading to the accumulation of unique mutations in key genes. These mutations may have a significant impact on various physiological functions, particularly carbon metabolism and energy metabolism. The presence of virulence-related elements associated with macromolecular secretion systems, virulence factors, and antimicrobial resistance indicated the potential pathogenicity of Ralstonia pickettii, making it capable of causing multiple nosocomial infections. This study provides comprehensive insights into the potential pathogenicity and adaptive evolution of Ralstonia pickettii in drinking water environments from a genomic perspective.
Additional Links: PMID-38370577
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38370577,
year = {2023},
author = {Yuan, C and An, T and Li, X and Zou, J and Lin, Z and Gu, J and Hu, R and Fang, Z},
title = {Genomic analysis of Ralstonia pickettii reveals the genetic features for potential pathogenicity and adaptive evolution in drinking water.},
journal = {Frontiers in microbiology},
volume = {14},
number = {},
pages = {1272636},
pmid = {38370577},
issn = {1664-302X},
abstract = {Ralstonia pickettii, the most critical clinical pathogen of the genus Ralstonia, has been identified as a causative agent of numerous harmful infections. Additionally, Ralstonia pickettii demonstrates adaptability to extreme environmental conditions, such as those found in drinking water. In this study, we conducted a comprehensive genomic analysis to investigate the genomic characteristics related to potential pathogenicity and adaptive evolution in drinking water environments of Ralstonia pickettii. Through phylogenetic analysis and population genetic analysis, we divided Ralstonia pickettii into five Groups, two of which were associated with drinking water environments. The open pan-genome with a large and flexible gene repertoire indicated a high genetic plasticity. Significant differences in functional enrichment were observed between the core- and pan-genome of different groups. Diverse mobile genetic elements (MGEs), extensive genomic rearrangements, and horizontal gene transfer (HGT) events played a crucial role in generating genetic diversity. In drinking water environments, Ralstonia pickettii exhibited strong adaptability, and the acquisition of specific adaptive genes was potentially facilitated by genomic islands (GIs) and HGT. Furthermore, environmental pressures drove the adaptive evolution of Ralstonia pickettii, leading to the accumulation of unique mutations in key genes. These mutations may have a significant impact on various physiological functions, particularly carbon metabolism and energy metabolism. The presence of virulence-related elements associated with macromolecular secretion systems, virulence factors, and antimicrobial resistance indicated the potential pathogenicity of Ralstonia pickettii, making it capable of causing multiple nosocomial infections. This study provides comprehensive insights into the potential pathogenicity and adaptive evolution of Ralstonia pickettii in drinking water environments from a genomic perspective.},
}
RevDate: 2024-02-16
Genomic basis of environmental adaptation in the widespread poly-extremophilic Exiguobacterium group.
The ISME journal, 18(1):.
Delineating cohesive ecological units and determining the genetic basis for their environmental adaptation are among the most important objectives in microbiology. In the last decade, many studies have been devoted to characterizing the genetic diversity in microbial populations to address these issues. However, the impact of extreme environmental conditions, such as temperature and salinity, on microbial ecology and evolution remains unclear so far. In order to better understand the mechanisms of adaptation, we studied the (pan)genome of Exiguobacterium, a poly-extremophile bacterium able to grow in a wide range of environments, from permafrost to hot springs. To have the genome for all known Exiguobacterium type strains, we first sequenced those that were not yet available. Using a reverse-ecology approach, we showed how the integration of phylogenomic information, genomic features, gene and pathway enrichment data, regulatory element analyses, protein amino acid composition, and protein structure analyses of the entire Exiguobacterium pangenome allows to sharply delineate ecological units consisting of mesophilic, psychrophilic, halophilic-mesophilic, and halophilic-thermophilic ecotypes. This in-depth study clarified the genetic basis of the defined ecotypes and identified some key mechanisms driving the environmental adaptation to extreme environments. Our study points the way to organizing the vast microbial diversity into meaningful ecologically units, which, in turn, provides insight into how microbial communities adapt and respond to different environmental conditions in a changing world.
Additional Links: PMID-38365240
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38365240,
year = {2024},
author = {Shen, L and Liu, Y and Chen, L and Lei, T and Ren, P and Ji, M and Song, W and Lin, H and Su, W and Wang, S and Rooman, M and Pucci, F},
title = {Genomic basis of environmental adaptation in the widespread poly-extremophilic Exiguobacterium group.},
journal = {The ISME journal},
volume = {18},
number = {1},
pages = {},
doi = {10.1093/ismejo/wrad020},
pmid = {38365240},
issn = {1751-7370},
support = {U21A20176//National Natural Science Foundation of China/ ; 2019QZKK0503//Second Tibetan Plateau Scientific Expedition and Research/ ; 92251304//National Natural Science Foundation of China/ ; swzy202008//Open Project Fund of Anhui Provincial Key Laboratory of Protection and Utilization of Important Biological Resources/ ; 2022AH010012//Anhui Provincial Engineering Research Centre for Molecular Detection and Diagnostics/ ; },
abstract = {Delineating cohesive ecological units and determining the genetic basis for their environmental adaptation are among the most important objectives in microbiology. In the last decade, many studies have been devoted to characterizing the genetic diversity in microbial populations to address these issues. However, the impact of extreme environmental conditions, such as temperature and salinity, on microbial ecology and evolution remains unclear so far. In order to better understand the mechanisms of adaptation, we studied the (pan)genome of Exiguobacterium, a poly-extremophile bacterium able to grow in a wide range of environments, from permafrost to hot springs. To have the genome for all known Exiguobacterium type strains, we first sequenced those that were not yet available. Using a reverse-ecology approach, we showed how the integration of phylogenomic information, genomic features, gene and pathway enrichment data, regulatory element analyses, protein amino acid composition, and protein structure analyses of the entire Exiguobacterium pangenome allows to sharply delineate ecological units consisting of mesophilic, psychrophilic, halophilic-mesophilic, and halophilic-thermophilic ecotypes. This in-depth study clarified the genetic basis of the defined ecotypes and identified some key mechanisms driving the environmental adaptation to extreme environments. Our study points the way to organizing the vast microbial diversity into meaningful ecologically units, which, in turn, provides insight into how microbial communities adapt and respond to different environmental conditions in a changing world.},
}
RevDate: 2024-02-16
Human pangenome analysis of sequences missing from the reference genome reveals their widespread evolutionary, phenotypic, and functional roles.
Nucleic acids research pii:7607875 [Epub ahead of print].
Nonreference sequences (NRSs) are DNA sequences present in global populations but absent in the current human reference genome. However, the extent and functional significance of NRSs in the human genomes and populations remains unclear. Here, we de novo assembled 539 genomes from five genetically divergent human populations using long-read sequencing technology, resulting in the identification of 5.1 million NRSs. These were merged into 45284 unique NRSs, with 29.7% being novel discoveries. Among these NRSs, 38.7% were common across the five populations, and 35.6% were population specific. The use of a graph-based pangenome approach allowed for the detection of 565 transcript expression quantitative trait loci on NRSs, with 426 of these being novel findings. Moreover, 26 NRS candidates displayed evidence of adaptive selection within human populations. Genes situated in close proximity to or intersecting with these candidates may be associated with metabolism and type 2 diabetes. Genome-wide association studies revealed 14 NRSs to be significantly associated with eight phenotypes. Additionally, 154 NRSs were found to be in strong linkage disequilibrium with 258 phenotype-associated SNPs in the GWAS catalogue. Our work expands the understanding of human NRSs and provides novel insights into their functions, facilitating evolutionary and biomedical researches.
Additional Links: PMID-38364871
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38364871,
year = {2024},
author = {Wu, Z and Li, T and Jiang, Z and Zheng, J and Gu, Y and Liu, Y and Liu, Y and Xie, Z},
title = {Human pangenome analysis of sequences missing from the reference genome reveals their widespread evolutionary, phenotypic, and functional roles.},
journal = {Nucleic acids research},
volume = {},
number = {},
pages = {},
doi = {10.1093/nar/gkae086},
pmid = {38364871},
issn = {1362-4962},
support = {2019YFA0904400//National Key Research and Development Program of China/ ; 202201020336//Science and Technology Program of Guangzhou, China/ ; },
abstract = {Nonreference sequences (NRSs) are DNA sequences present in global populations but absent in the current human reference genome. However, the extent and functional significance of NRSs in the human genomes and populations remains unclear. Here, we de novo assembled 539 genomes from five genetically divergent human populations using long-read sequencing technology, resulting in the identification of 5.1 million NRSs. These were merged into 45284 unique NRSs, with 29.7% being novel discoveries. Among these NRSs, 38.7% were common across the five populations, and 35.6% were population specific. The use of a graph-based pangenome approach allowed for the detection of 565 transcript expression quantitative trait loci on NRSs, with 426 of these being novel findings. Moreover, 26 NRS candidates displayed evidence of adaptive selection within human populations. Genes situated in close proximity to or intersecting with these candidates may be associated with metabolism and type 2 diabetes. Genome-wide association studies revealed 14 NRSs to be significantly associated with eight phenotypes. Additionally, 154 NRSs were found to be in strong linkage disequilibrium with 258 phenotype-associated SNPs in the GWAS catalogue. Our work expands the understanding of human NRSs and provides novel insights into their functions, facilitating evolutionary and biomedical researches.},
}
RevDate: 2024-02-16
DandD: Efficient measurement of sequence growth and similarity.
iScience, 27(3):109054 pii:S2589-0042(24)00275-X.
Genome assembly databases are growing rapidly. The redundancy of sequence content between a new assembly and previous ones is neither conceptually nor algorithmically easy to measure. We introduce pertinent methods and DandD, a tool addressing how much new sequence is gained when a sequence collection grows. DandD can describe how much structural variation is discovered in each new human genome assembly and when discoveries will level off in the future. DandD uses a measure called δ ("delta"), developed initially for data compression and chiefly dependent on k-mer counts. DandD rapidly estimates δ using genomic sketches. We propose δ as an alternative to k-mer-specific cardinalities when computing the Jaccard coefficient, thereby avoiding the pitfalls of a poor choice of k. We demonstrate the utility of DandD's functions for estimating δ, characterizing the rate of pangenome growth, and computing all-pairs similarities using k-independent Jaccard.
Additional Links: PMID-38361606
Full Text:
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38361606,
year = {2024},
author = {Bonnie, JK and Ahmed, OY and Langmead, B},
title = {DandD: Efficient measurement of sequence growth and similarity.},
journal = {iScience},
volume = {27},
number = {3},
pages = {109054},
doi = {10.1016/j.isci.2024.109054},
pmid = {38361606},
issn = {2589-0042},
abstract = {Genome assembly databases are growing rapidly. The redundancy of sequence content between a new assembly and previous ones is neither conceptually nor algorithmically easy to measure. We introduce pertinent methods and DandD, a tool addressing how much new sequence is gained when a sequence collection grows. DandD can describe how much structural variation is discovered in each new human genome assembly and when discoveries will level off in the future. DandD uses a measure called δ ("delta"), developed initially for data compression and chiefly dependent on k-mer counts. DandD rapidly estimates δ using genomic sketches. We propose δ as an alternative to k-mer-specific cardinalities when computing the Jaccard coefficient, thereby avoiding the pitfalls of a poor choice of k. We demonstrate the utility of DandD's functions for estimating δ, characterizing the rate of pangenome growth, and computing all-pairs similarities using k-independent Jaccard.},
}
RevDate: 2024-02-15
Advance typing of Vibrio parahaemolyticus through the mtlA and aer gene: A high-resolution, cost-effective approach.
Heliyon, 10(3):e25642.
Vibrio parahaemolyticus is a significant cause of foodborne illness, and its incidence worldwide is on the rise. It is thus imperative to develop a straightforward and efficient method for typing strains of this pathogen. In this study, we conducted a pangenome analysis of 75 complete genomes of V. parahaemolyticus and identified the core gene mtlA with the highest degree of variation, which distinguished 44 strains and outperformed traditional seven-gene-based MLST when combined with aer, another core gene with high degree of variation. The mtlA gene had higher resolution to type strains with a close relationship compared to the traditional MLST genes in the phylogenetic tree built by core genomes. Strong positive selection was also detected in the gene mtlA (ω > 1), representing adaptive and evolution in response to the environment. Therefore, the panel of gene mtlA and aer may serve as a tool for the typing of V. parahaemolyticus, potentially contributing to the prevention and control of this foodborne disease.
Additional Links: PMID-38356529
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38356529,
year = {2024},
author = {Zhou, L and Liu, D and Zhu, Y and Zhang, Z and Chen, S and Zhao, G and Zheng, H},
title = {Advance typing of Vibrio parahaemolyticus through the mtlA and aer gene: A high-resolution, cost-effective approach.},
journal = {Heliyon},
volume = {10},
number = {3},
pages = {e25642},
pmid = {38356529},
issn = {2405-8440},
abstract = {Vibrio parahaemolyticus is a significant cause of foodborne illness, and its incidence worldwide is on the rise. It is thus imperative to develop a straightforward and efficient method for typing strains of this pathogen. In this study, we conducted a pangenome analysis of 75 complete genomes of V. parahaemolyticus and identified the core gene mtlA with the highest degree of variation, which distinguished 44 strains and outperformed traditional seven-gene-based MLST when combined with aer, another core gene with high degree of variation. The mtlA gene had higher resolution to type strains with a close relationship compared to the traditional MLST genes in the phylogenetic tree built by core genomes. Strong positive selection was also detected in the gene mtlA (ω > 1), representing adaptive and evolution in response to the environment. Therefore, the panel of gene mtlA and aer may serve as a tool for the typing of V. parahaemolyticus, potentially contributing to the prevention and control of this foodborne disease.},
}
RevDate: 2024-02-14
Pangenome genotyped structural variation improves molecular phenotype mapping in cattle.
Genome research pii:gr.278267.123 [Epub ahead of print].
Expression and splicing quantitative trait loci (e/sQTL) are large contributors to phenotypic variability. Achieving sufficient statistical power for e/sQTL mapping requires large cohorts with both genotypes and molecular phenotypes, and so the genomic variation is often called from short-read alignments which are unable to comprehensively resolve structural variation. Here we build a pangenome from 16 HiFi haplotype-resolved assemblies to identify small and structural variation and genotype them with PanGenie in 307 short-read samples. We find high (>90%) concordance of PanGenie-genotyped and DeepVariant-called small variation, and confidently genotype close to 21M small and 43k structural variants in the larger population. We validate 85% of these structural variants (with MAF>0.1) directly with a subset of 25 short-read samples that also have medium coverage HiFi reads. We then conduct e/sQTL mapping with this comprehensive variant set in a subset of 117 cattle that have testis transcriptome data and find 92 structural variants as causal candidates for eQTL and 73 for sQTL. We find that roughly half of top associated structural variants affecting expression or splicing are transposable elements, such as SV-eQTLs for STN1 and MYH7 and SV-sQTLs for CEP89 and ASAH2 Extensive linkage disequilibrium between small and structural variation results in only 28 additional eQTL and 17 sQTL discovered when including SVs, although many top associated SVs are compelling candidates.
Additional Links: PMID-38355307
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38355307,
year = {2024},
author = {Leonard, AS and Mapel, XM and Pausch, H},
title = {Pangenome genotyped structural variation improves molecular phenotype mapping in cattle.},
journal = {Genome research},
volume = {},
number = {},
pages = {},
doi = {10.1101/gr.278267.123},
pmid = {38355307},
issn = {1549-5469},
abstract = {Expression and splicing quantitative trait loci (e/sQTL) are large contributors to phenotypic variability. Achieving sufficient statistical power for e/sQTL mapping requires large cohorts with both genotypes and molecular phenotypes, and so the genomic variation is often called from short-read alignments which are unable to comprehensively resolve structural variation. Here we build a pangenome from 16 HiFi haplotype-resolved assemblies to identify small and structural variation and genotype them with PanGenie in 307 short-read samples. We find high (>90%) concordance of PanGenie-genotyped and DeepVariant-called small variation, and confidently genotype close to 21M small and 43k structural variants in the larger population. We validate 85% of these structural variants (with MAF>0.1) directly with a subset of 25 short-read samples that also have medium coverage HiFi reads. We then conduct e/sQTL mapping with this comprehensive variant set in a subset of 117 cattle that have testis transcriptome data and find 92 structural variants as causal candidates for eQTL and 73 for sQTL. We find that roughly half of top associated structural variants affecting expression or splicing are transposable elements, such as SV-eQTLs for STN1 and MYH7 and SV-sQTLs for CEP89 and ASAH2 Extensive linkage disequilibrium between small and structural variation results in only 28 additional eQTL and 17 sQTL discovered when including SVs, although many top associated SVs are compelling candidates.},
}
RevDate: 2024-02-14
Average Nucleotide Identity based Staphylococcus aureus strain grouping allows identification of strain-specific genes in the pangenome.
bioRxiv : the preprint server for biology pii:2024.01.29.577756.
UNLABELLED: Staphylococcus aureus causes both hospital and community acquired infections in humans worldwide. Due to the high incidence of infection S. aureus is also one of the most sampled and sequenced pathogens today, providing an outstanding resource to understand variation at the bacterial subspecies level. We processed and downsampled 83,383 public S. aureus Illumina whole genome shotgun sequences and 1,263 complete genomes to produce 7,954 representative substrains. Pairwise comparison of core gene Average Nucleotide Identity (ANI) revealed a natural boundary of 99.5% that could be used to define 145 distinct strains within the species. We found that intermediate frequency genes in the pangenome (present in 10-95% of genomes) could be divided into those closely linked to strain background ("strain-concentrated") and those highly variable within strains ("strain-diffuse"). Non-core genes had different patterns of chromosome location; notably, strain-diffuse associated with prophages, strain-concentrated with the vSaβ genome island and rare genes (<10% frequency) concentrated near the origin of replication. Antibiotic genes were enriched in the strain-diffuse class, while virulence genes were distributed between strain-diffuse, strain-concentrated, core and rare classes. This study shows how different patterns of gene movement help create strains as distinct subspecies entities and provide insight into the diverse histories of important S. aureus functions.
IMPORTANCE: We analyzed the genomic diversity of Staphylococcus aureus , a globally prevalent bacterial species that causes serious infections in humans. Our goal was to build a genetic picture of the different strains of S. aureus and which genes may be associated with them. We used a large public dataset (>84,000 genomes) that was re-processed and subsampled to remove redundancy. We found that individual genomes could be grouped into strains by sharing > 99.5% identical nucleotide sequence of the core part of their genome. We also showed that a portion of genes that are present in intermediate frequency in the species are strongly associated with some strains but completely absent from others, suggesting a role in strain-specificity. This work lays the foundation for understanding individual gene histories of the S. aureus species and also outlines strategies for processing large bacterial genomic datasets.
Additional Links: PMID-38352482
Full Text:
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38352482,
year = {2024},
author = {Raghuram, V and Petit, RA and Karol, Z and Mehta, R and Weissman, DB and Read, TD},
title = {Average Nucleotide Identity based Staphylococcus aureus strain grouping allows identification of strain-specific genes in the pangenome.},
journal = {bioRxiv : the preprint server for biology},
volume = {},
number = {},
pages = {},
doi = {10.1101/2024.01.29.577756},
pmid = {38352482},
abstract = {UNLABELLED: Staphylococcus aureus causes both hospital and community acquired infections in humans worldwide. Due to the high incidence of infection S. aureus is also one of the most sampled and sequenced pathogens today, providing an outstanding resource to understand variation at the bacterial subspecies level. We processed and downsampled 83,383 public S. aureus Illumina whole genome shotgun sequences and 1,263 complete genomes to produce 7,954 representative substrains. Pairwise comparison of core gene Average Nucleotide Identity (ANI) revealed a natural boundary of 99.5% that could be used to define 145 distinct strains within the species. We found that intermediate frequency genes in the pangenome (present in 10-95% of genomes) could be divided into those closely linked to strain background ("strain-concentrated") and those highly variable within strains ("strain-diffuse"). Non-core genes had different patterns of chromosome location; notably, strain-diffuse associated with prophages, strain-concentrated with the vSaβ genome island and rare genes (<10% frequency) concentrated near the origin of replication. Antibiotic genes were enriched in the strain-diffuse class, while virulence genes were distributed between strain-diffuse, strain-concentrated, core and rare classes. This study shows how different patterns of gene movement help create strains as distinct subspecies entities and provide insight into the diverse histories of important S. aureus functions.
IMPORTANCE: We analyzed the genomic diversity of Staphylococcus aureus , a globally prevalent bacterial species that causes serious infections in humans. Our goal was to build a genetic picture of the different strains of S. aureus and which genes may be associated with them. We used a large public dataset (>84,000 genomes) that was re-processed and subsampled to remove redundancy. We found that individual genomes could be grouped into strains by sharing > 99.5% identical nucleotide sequence of the core part of their genome. We also showed that a portion of genes that are present in intermediate frequency in the species are strongly associated with some strains but completely absent from others, suggesting a role in strain-specificity. This work lays the foundation for understanding individual gene histories of the S. aureus species and also outlines strategies for processing large bacterial genomic datasets.},
}
RevDate: 2024-02-14
Large-scale gene expression alterations introduced by structural variation drive morphotype diversification in Brassica oleracea.
Nature genetics [Epub ahead of print].
Brassica oleracea, globally cultivated for its vegetable crops, consists of very diverse morphotypes, characterized by specialized enlarged organs as harvested products. This makes B. oleracea an ideal model for studying rapid evolution and domestication. We constructed a B. oleracea pan-genome from 27 high-quality genomes representing all morphotypes and their wild relatives. We identified structural variations (SVs) among these genomes and characterized these in 704 B. oleracea accessions using graph-based genome tools. We show that SVs exert bidirectional effects on the expression of numerous genes, either suppressing through DNA methylation or promoting probably by harboring transcription factor-binding elements. The following examples illustrate the role of SVs modulating gene expression: SVs promoting BoPNY and suppressing BoCKX3 in cauliflower/broccoli, suppressing BoKAN1 and BoACS4 in cabbage and promoting BoMYBtf in ornamental kale. These results provide solid evidence for the role of SVs as dosage regulators of gene expression, driving B. oleracea domestication and diversification.
Additional Links: PMID-38351383
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38351383,
year = {2024},
author = {Li, X and Wang, Y and Cai, C and Ji, J and Han, F and Zhang, L and Chen, S and Zhang, L and Yang, Y and Tang, Q and Bucher, J and Wang, X and Yang, L and Zhuang, M and Zhang, K and Lv, H and Bonnema, G and Zhang, Y and Cheng, F},
title = {Large-scale gene expression alterations introduced by structural variation drive morphotype diversification in Brassica oleracea.},
journal = {Nature genetics},
volume = {},
number = {},
pages = {},
pmid = {38351383},
issn = {1546-1718},
support = {31972411//National Natural Science Foundation of China (National Science Foundation of China)/ ; 31722048//National Natural Science Foundation of China (National Science Foundation of China)/ ; 32172578//National Natural Science Foundation of China (National Science Foundation of China)/ ; 201809110159//China Scholarship Council (CSC)/ ; },
abstract = {Brassica oleracea, globally cultivated for its vegetable crops, consists of very diverse morphotypes, characterized by specialized enlarged organs as harvested products. This makes B. oleracea an ideal model for studying rapid evolution and domestication. We constructed a B. oleracea pan-genome from 27 high-quality genomes representing all morphotypes and their wild relatives. We identified structural variations (SVs) among these genomes and characterized these in 704 B. oleracea accessions using graph-based genome tools. We show that SVs exert bidirectional effects on the expression of numerous genes, either suppressing through DNA methylation or promoting probably by harboring transcription factor-binding elements. The following examples illustrate the role of SVs modulating gene expression: SVs promoting BoPNY and suppressing BoCKX3 in cauliflower/broccoli, suppressing BoKAN1 and BoACS4 in cabbage and promoting BoMYBtf in ornamental kale. These results provide solid evidence for the role of SVs as dosage regulators of gene expression, driving B. oleracea domestication and diversification.},
}
RevDate: 2024-02-12
Genomic analysis and experimental pathogenic characterization of Riemerella anatipestifer isolates from chickens in China.
Poultry science, 103(4):103497 pii:S0032-5791(24)00076-2 [Epub ahead of print].
Waterfowl have a high likelihood of being infected with Riemerella anatipestifer. Although the pathogen is found in domestic ducks, turkeys, geese, and wild birds, there is little information available about the consequences of infection during egg laying and hatching in chickens. Here, we present the first report of a novel sequence type of R. anatipestifer S63 isolated from chickens in China. On the basis of pan-genome analysis, we showed S63's genome occupies a distinct branch with other R. anatipestifer isolates from other hosts. Galleria mellonella larval tests indicated that S63 is less virulent than R. anatipestifer Ra36 isolated from ducks. Ducks and hens are susceptible to S63 infection. There is no mortality rate for chickens or ducks, but adult chickens experience neurological symptoms that reduce egg production and hatching rates. In chickens, S63 might be passed vertically from parents to offspring, resulting in "jelly-like" lifeless embryos. Using quantitative PCR, S63 was detected in the brain, liver, reproductive organs, and embryos. As far as we know, this is the first report of R. anatipestifer in hens, a disease that can reduce egg productivity, lower hatching rates, and produce jelly-like lifeless embryos, and the first report to raise the possibility that hens can be infected by roosters via semen.
Additional Links: PMID-38346372
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38346372,
year = {2024},
author = {Chen, Y and Li, X and Liu, Z and Hu, M and Ma, J and Luo, Y and Zhang, Q and Li, L and Zhao, X and Zhao, M and Liu, W and Liu, Y},
title = {Genomic analysis and experimental pathogenic characterization of Riemerella anatipestifer isolates from chickens in China.},
journal = {Poultry science},
volume = {103},
number = {4},
pages = {103497},
doi = {10.1016/j.psj.2024.103497},
pmid = {38346372},
issn = {1525-3171},
abstract = {Waterfowl have a high likelihood of being infected with Riemerella anatipestifer. Although the pathogen is found in domestic ducks, turkeys, geese, and wild birds, there is little information available about the consequences of infection during egg laying and hatching in chickens. Here, we present the first report of a novel sequence type of R. anatipestifer S63 isolated from chickens in China. On the basis of pan-genome analysis, we showed S63's genome occupies a distinct branch with other R. anatipestifer isolates from other hosts. Galleria mellonella larval tests indicated that S63 is less virulent than R. anatipestifer Ra36 isolated from ducks. Ducks and hens are susceptible to S63 infection. There is no mortality rate for chickens or ducks, but adult chickens experience neurological symptoms that reduce egg production and hatching rates. In chickens, S63 might be passed vertically from parents to offspring, resulting in "jelly-like" lifeless embryos. Using quantitative PCR, S63 was detected in the brain, liver, reproductive organs, and embryos. As far as we know, this is the first report of R. anatipestifer in hens, a disease that can reduce egg productivity, lower hatching rates, and produce jelly-like lifeless embryos, and the first report to raise the possibility that hens can be infected by roosters via semen.},
}
RevDate: 2024-02-10
Comparative Analysis of Chloroplast Pan-Genomes and Transcriptomics Reveals Cold Adaptation in Medicago sativa.
International journal of molecular sciences, 25(3): pii:ijms25031776.
Alfalfa (Medicago sativa) is a perennial forage legume that is widely distributed all over the world; therefore, it has an extremely complex genetic background. Though population structure and phylogenetic studies have been conducted on a large group of alfalfa nuclear genomes, information about the chloroplast genomes is still lacking. Chloroplast genomes are generally considered to be conservative and play an important role in population diversity analysis and species adaptation in plants. Here, 231 complete alfalfa chloroplast genomes were successfully assembled from 359 alfalfa resequencing data, on the basis of which the alfalfa chloroplast pan-genome was constructed. We investigated the genetic variations of the alfalfa chloroplast genome through comparative genomic, genetic diversity, phylogenetic, population genetic structure, and haplotype analysis. Meanwhile, the expression of alfalfa chloroplast genes under cold stress was explored through transcriptome analysis. As a result, chloroplast genomes of 231 alfalfa lack an IR region, and the size of the chloroplast genome ranges from 125,192 bp to 126,105 bp. Using population structure, haplotypes, and construction of a phylogenetic tree, it was found that alfalfa populations could be divided into four groups, and multiple highly variable regions were found in the alfalfa chloroplast genome. Transcriptome analysis showed that tRNA genes were significantly up-regulated in the cold-sensitive varieties, while rps7, rpl32, and ndhB were down-regulated, and the editing efficiency of ycf1, ycf2, and ndhF was decreased in the cold-tolerant varieties, which may be due to the fact that chloroplasts store nutrients through photosynthesis to resist cold. The huge number of genetic variants in this study provide powerful resources for molecular markers.
Additional Links: PMID-38339052
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38339052,
year = {2024},
author = {Zhang, T and Chen, X and Yan, W and Li, M and Huang, W and Liu, Q and Li, Y and Guo, C and Shu, Y},
title = {Comparative Analysis of Chloroplast Pan-Genomes and Transcriptomics Reveals Cold Adaptation in Medicago sativa.},
journal = {International journal of molecular sciences},
volume = {25},
number = {3},
pages = {},
doi = {10.3390/ijms25031776},
pmid = {38339052},
issn = {1422-0067},
support = {LH2022C050//Natural Science Foundation of Heilongjiang Province/ ; HSDSSCX2023-42//the Innovative Project for Postgraduate Students of Harbin Normal University/ ; FKL-202203//the Open Fund of Yunnan Province Flower Breeding Key Laboratory/ ; 202301BD070001-208//Agriculture Joint Special Project of Science and Technology Plan Project of Yunnan Science and Technology Department/ ; 530000210000000013742//the Green Food Brand Build a Special Project (Floriculture) supported by Yunnan Provincial Fi-nance Department/ ; U21A20182//the Natural and Science Foundation of China/ ; Qian Liu//Construction of Tengchong Rural Revitalization Technological Innovation County/ ; },
abstract = {Alfalfa (Medicago sativa) is a perennial forage legume that is widely distributed all over the world; therefore, it has an extremely complex genetic background. Though population structure and phylogenetic studies have been conducted on a large group of alfalfa nuclear genomes, information about the chloroplast genomes is still lacking. Chloroplast genomes are generally considered to be conservative and play an important role in population diversity analysis and species adaptation in plants. Here, 231 complete alfalfa chloroplast genomes were successfully assembled from 359 alfalfa resequencing data, on the basis of which the alfalfa chloroplast pan-genome was constructed. We investigated the genetic variations of the alfalfa chloroplast genome through comparative genomic, genetic diversity, phylogenetic, population genetic structure, and haplotype analysis. Meanwhile, the expression of alfalfa chloroplast genes under cold stress was explored through transcriptome analysis. As a result, chloroplast genomes of 231 alfalfa lack an IR region, and the size of the chloroplast genome ranges from 125,192 bp to 126,105 bp. Using population structure, haplotypes, and construction of a phylogenetic tree, it was found that alfalfa populations could be divided into four groups, and multiple highly variable regions were found in the alfalfa chloroplast genome. Transcriptome analysis showed that tRNA genes were significantly up-regulated in the cold-sensitive varieties, while rps7, rpl32, and ndhB were down-regulated, and the editing efficiency of ycf1, ycf2, and ndhF was decreased in the cold-tolerant varieties, which may be due to the fact that chloroplasts store nutrients through photosynthesis to resist cold. The huge number of genetic variants in this study provide powerful resources for molecular markers.},
}
RevDate: 2024-02-10
PanEffect: a pan-genome visualization tool for variant effects in maize.
Bioinformatics (Oxford, England) pii:7604577 [Epub ahead of print].
UNLABELLED: Understanding the effects of genetic variants is crucial for accurately predicting traits and functional outcomes. Recent approaches have utilized artificial intelligence and protein language models to score all possible missense variant effects at the proteome level for a single genome, but a reliable tool is needed to explore these effects at the pan-genome level. To address this gap, we introduce a new tool called PanEffect. We implemented PanEffect at MaizeGDB to enable a comprehensive examination of the potential effects of coding variants across 50 maize genomes. The tool allows users to visualize over 550 million possible amino acid substitutions in the B73 maize reference genome and to observe the effects of the 2.3 million natural variations in the maize pan-genome. Each variant effect score, calculated from the Evolutionary Scale Modeling (ESM) protein language model, shows the log-likelihood ratio difference between B73 and all variants in the pan-genome. These scores are shown using heatmaps spanning benign outcomes to potential functional consequences. Additionally, PanEffect displays secondary structures and functional domains along with the variant effects, offering additional functional and structural context. Using PanEffect, researchers now have a platform to explore protein variants and identify genetic targets for crop enhancement.
AVAILABILITY: The PanEffect code is freely available on GitHub (https://github.com/Maize-Genetics-and-Genomics-Database/PanEffect). A maize implementation of PanEffect and underlying datasets are available at MaizeGDB (https://www.maizegdb.org/effect/maize/).
SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Additional Links: PMID-38337024
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38337024,
year = {2024},
author = {Andorf, CM and Haley, OC and Hayford, RK and Portwood, JL and Harding, S and Sen, S and Cannon, EK and Gardiner, JM and Kim, HS and Woodhouse, MR},
title = {PanEffect: a pan-genome visualization tool for variant effects in maize.},
journal = {Bioinformatics (Oxford, England)},
volume = {},
number = {},
pages = {},
doi = {10.1093/bioinformatics/btae073},
pmid = {38337024},
issn = {1367-4811},
abstract = {UNLABELLED: Understanding the effects of genetic variants is crucial for accurately predicting traits and functional outcomes. Recent approaches have utilized artificial intelligence and protein language models to score all possible missense variant effects at the proteome level for a single genome, but a reliable tool is needed to explore these effects at the pan-genome level. To address this gap, we introduce a new tool called PanEffect. We implemented PanEffect at MaizeGDB to enable a comprehensive examination of the potential effects of coding variants across 50 maize genomes. The tool allows users to visualize over 550 million possible amino acid substitutions in the B73 maize reference genome and to observe the effects of the 2.3 million natural variations in the maize pan-genome. Each variant effect score, calculated from the Evolutionary Scale Modeling (ESM) protein language model, shows the log-likelihood ratio difference between B73 and all variants in the pan-genome. These scores are shown using heatmaps spanning benign outcomes to potential functional consequences. Additionally, PanEffect displays secondary structures and functional domains along with the variant effects, offering additional functional and structural context. Using PanEffect, researchers now have a platform to explore protein variants and identify genetic targets for crop enhancement.
AVAILABILITY: The PanEffect code is freely available on GitHub (https://github.com/Maize-Genetics-and-Genomics-Database/PanEffect). A maize implementation of PanEffect and underlying datasets are available at MaizeGDB (https://www.maizegdb.org/effect/maize/).
SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.},
}
RevDate: 2024-02-09
Evaluating the Mechanism of Cell Death in Melanoma Induced by the Cannabis Extract PHEC-66.
Cells, 13(3): pii:cells13030268.
Research suggests the potential of using cannabinoid-derived compounds to function as anticancer agents against melanoma cells. Our recent study highlighted the remarkable in vitro anticancer effects of PHEC-66, an extract from Cannabis sativa, on the MM418-C1, MM329, and MM96L melanoma cell lines. However, the complete molecular mechanism behind this action remains to be elucidated. This study aims to unravel how PHEC-66 brings about its antiproliferative impact on these cell lines, utilising diverse techniques such as real-time polymerase chain reaction (qPCR), assays to assess the inhibition of CB1 and CB2 receptors, measurement of reactive oxygen species (ROS), apoptosis assays, and fluorescence-activated cell sorting (FACS) for apoptosis and cell cycle analysis. The outcomes obtained from this study suggest that PHEC-66 triggers apoptosis in these melanoma cell lines by increasing the expression of pro-apoptotic markers (BAX mRNA) while concurrently reducing the expression of anti-apoptotic markers (Bcl-2 mRNA). Additionally, PHEC-66 induces DNA fragmentation, halting cell progression at the G1 cell cycle checkpoint and substantially elevating intracellular ROS levels. These findings imply that PHEC-66 might have potential as an adjuvant therapy in the treatment of malignant melanoma. However, it is essential to conduct further preclinical investigations to delve deeper into its potential and efficacy.
Additional Links: PMID-38334660
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38334660,
year = {2024},
author = {Bachari, A and Nassar, N and Telukutla, S and Zomer, R and Piva, TJ and Mantri, N},
title = {Evaluating the Mechanism of Cell Death in Melanoma Induced by the Cannabis Extract PHEC-66.},
journal = {Cells},
volume = {13},
number = {3},
pages = {},
doi = {10.3390/cells13030268},
pmid = {38334660},
issn = {2073-4409},
support = {Not Applicable//MGC Pharmaceuticals Ltd/ ; },
abstract = {Research suggests the potential of using cannabinoid-derived compounds to function as anticancer agents against melanoma cells. Our recent study highlighted the remarkable in vitro anticancer effects of PHEC-66, an extract from Cannabis sativa, on the MM418-C1, MM329, and MM96L melanoma cell lines. However, the complete molecular mechanism behind this action remains to be elucidated. This study aims to unravel how PHEC-66 brings about its antiproliferative impact on these cell lines, utilising diverse techniques such as real-time polymerase chain reaction (qPCR), assays to assess the inhibition of CB1 and CB2 receptors, measurement of reactive oxygen species (ROS), apoptosis assays, and fluorescence-activated cell sorting (FACS) for apoptosis and cell cycle analysis. The outcomes obtained from this study suggest that PHEC-66 triggers apoptosis in these melanoma cell lines by increasing the expression of pro-apoptotic markers (BAX mRNA) while concurrently reducing the expression of anti-apoptotic markers (Bcl-2 mRNA). Additionally, PHEC-66 induces DNA fragmentation, halting cell progression at the G1 cell cycle checkpoint and substantially elevating intracellular ROS levels. These findings imply that PHEC-66 might have potential as an adjuvant therapy in the treatment of malignant melanoma. However, it is essential to conduct further preclinical investigations to delve deeper into its potential and efficacy.},
}
RevDate: 2024-02-09
Taxonomic classification of genus Aeromonas using open reading frame-based binarized structure network analysis.
Fujita medical journal, 10(1):8-15.
OBJECTIVES: Taxonomic assignment based on whole-genome sequencing data facilitates clear demarcation of species within a complex genus. Here, we applied a unique pan-genome phylogenetic method, open reading frame (ORF)-based binarized structure network analysis (OSNA), for taxonomic inference of Aeromonas spp., a complex taxonomic group consisting of 30 species.
METHODS: Data from 335 publicly available Aeromonas genomes, including the reference genomes of 30 species, were used to build a phylogenetic tree using OSNA. In OSNA, whole-genome structures are expressed as binary sequences based on the presence or absence of ORFs, and a tree is generated using neighbor-net, a distance-based method for constructing phylogenetic networks from binary sequences. The tree built by OSNA was compared to that constructed by a core-genome single-nucleotide polymorphism (SNP)-based analysis. Furthermore, the orthologous average nucleotide identity (OrthoANI) values of the sequences that clustered in a single clade in the OSNA-based tree were calculated.
RESULTS: The phylogenetic tree constructed with OSNA successfully delineated the majority of species of the genus Aeromonas forming conspecific clades for individual species, which was corroborated by OrthoANI values. Moreover, the OSNA-based phylogenetic tree demonstrated high compositional similarity to the core-genome SNP-based phylogenetic tree, supported by the Fowlkes-Mallows index.
CONCLUSIONS: We propose that OSNA is a useful tool in predicting the taxonomic classification of complex bacterial genera.
Additional Links: PMID-38332778
Full Text:
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38332778,
year = {2024},
author = {Sakurai, A and Suzuki, M and Hayashi, K and Doi, Y},
title = {Taxonomic classification of genus Aeromonas using open reading frame-based binarized structure network analysis.},
journal = {Fujita medical journal},
volume = {10},
number = {1},
pages = {8-15},
doi = {10.20407/fmj.2023-007},
pmid = {38332778},
issn = {2189-7255},
abstract = {OBJECTIVES: Taxonomic assignment based on whole-genome sequencing data facilitates clear demarcation of species within a complex genus. Here, we applied a unique pan-genome phylogenetic method, open reading frame (ORF)-based binarized structure network analysis (OSNA), for taxonomic inference of Aeromonas spp., a complex taxonomic group consisting of 30 species.
METHODS: Data from 335 publicly available Aeromonas genomes, including the reference genomes of 30 species, were used to build a phylogenetic tree using OSNA. In OSNA, whole-genome structures are expressed as binary sequences based on the presence or absence of ORFs, and a tree is generated using neighbor-net, a distance-based method for constructing phylogenetic networks from binary sequences. The tree built by OSNA was compared to that constructed by a core-genome single-nucleotide polymorphism (SNP)-based analysis. Furthermore, the orthologous average nucleotide identity (OrthoANI) values of the sequences that clustered in a single clade in the OSNA-based tree were calculated.
RESULTS: The phylogenetic tree constructed with OSNA successfully delineated the majority of species of the genus Aeromonas forming conspecific clades for individual species, which was corroborated by OrthoANI values. Moreover, the OSNA-based phylogenetic tree demonstrated high compositional similarity to the core-genome SNP-based phylogenetic tree, supported by the Fowlkes-Mallows index.
CONCLUSIONS: We propose that OSNA is a useful tool in predicting the taxonomic classification of complex bacterial genera.},
}
RevDate: 2024-02-08
Genomic surveillance of Clostridioides difficile transmission and virulence in a healthcare setting.
mBio [Epub ahead of print].
Clostridioides difficile infection (CDI) is a major cause of healthcare-associated diarrhea, despite the widespread implementation of contact precautions for patients with CDI. Here, we investigate strain contamination in a hospital setting and the genomic determinants of disease outcomes. Across two wards over 6 months, we selectively cultured C. difficile from patients (n = 384) and their environments. Whole-genome sequencing (WGS) of 146 isolates revealed that most C. difficile isolates were from clade 1 (131/146, 89.7%), while only one isolate of the hypervirulent ST1 was recovered. Of culture-positive admissions (n = 79), 19 (24%) patients were colonized with toxigenic C. difficile on admission to the hospital. We defined 25 strain networks at ≤2 core gene single nucleotide polymorphisms; two of these networks contain strains from different patients. Strain networks were temporally linked (P < 0.0001). To understand the genomic correlates of the disease, we conducted WGS on an additional cohort of C. difficile (n = 102 isolates) from the same hospital and confirmed that clade 1 isolates are responsible for most CDI cases. We found that while toxigenic C. difficile isolates are associated with the presence of cdtR, nontoxigenic isolates have an increased abundance of prophages. Our pangenomic analysis of clade 1 isolates suggests that while toxin genes (tcdABER and cdtR) were associated with CDI symptoms, they are dispensable for patient colonization. These data indicate that toxigenic and nontoxigenic C. difficile contamination persist in a hospital setting and highlight further investigation into how accessory genomic repertoires contribute to C. difficile colonization and disease.IMPORTANCEClostridioides difficile infection remains a leading cause of hospital-associated diarrhea, despite increased antibiotic stewardship and transmission prevention strategies. This suggests a changing genomic landscape of C. difficile. Our study provides insight into the nature of prevalent C. difficile strains in a hospital setting and transmission patterns among carriers. Longitudinal sampling of surfaces and patient stool revealed that both toxigenic and nontoxigenic strains of C. difficile clade 1 dominate these two wards. Moreover, quantification of transmission in carriers of these clade 1 isolates underscores the need to revisit infection prevention measures in this patient group. We identified unique genetic signatures associated with virulence in this clade. Our data highlight the complexities of preventing transmission of this pathogen in a hospital setting and the need to investigate the mechanisms of in vivo persistence and virulence of prevalent lineages in the host gut microbiome.
Additional Links: PMID-38329369
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38329369,
year = {2024},
author = {Newcomer, EP and Fishbein, SRS and Zhang, K and Hink, T and Reske, KA and Cass, C and Iqbal, ZH and Struttmann, EL and Burnham, C-AD and Dubberke, ER and Dantas, G},
title = {Genomic surveillance of Clostridioides difficile transmission and virulence in a healthcare setting.},
journal = {mBio},
volume = {},
number = {},
pages = {e0330023},
doi = {10.1128/mbio.03300-23},
pmid = {38329369},
issn = {2150-7511},
abstract = {Clostridioides difficile infection (CDI) is a major cause of healthcare-associated diarrhea, despite the widespread implementation of contact precautions for patients with CDI. Here, we investigate strain contamination in a hospital setting and the genomic determinants of disease outcomes. Across two wards over 6 months, we selectively cultured C. difficile from patients (n = 384) and their environments. Whole-genome sequencing (WGS) of 146 isolates revealed that most C. difficile isolates were from clade 1 (131/146, 89.7%), while only one isolate of the hypervirulent ST1 was recovered. Of culture-positive admissions (n = 79), 19 (24%) patients were colonized with toxigenic C. difficile on admission to the hospital. We defined 25 strain networks at ≤2 core gene single nucleotide polymorphisms; two of these networks contain strains from different patients. Strain networks were temporally linked (P < 0.0001). To understand the genomic correlates of the disease, we conducted WGS on an additional cohort of C. difficile (n = 102 isolates) from the same hospital and confirmed that clade 1 isolates are responsible for most CDI cases. We found that while toxigenic C. difficile isolates are associated with the presence of cdtR, nontoxigenic isolates have an increased abundance of prophages. Our pangenomic analysis of clade 1 isolates suggests that while toxin genes (tcdABER and cdtR) were associated with CDI symptoms, they are dispensable for patient colonization. These data indicate that toxigenic and nontoxigenic C. difficile contamination persist in a hospital setting and highlight further investigation into how accessory genomic repertoires contribute to C. difficile colonization and disease.IMPORTANCEClostridioides difficile infection remains a leading cause of hospital-associated diarrhea, despite increased antibiotic stewardship and transmission prevention strategies. This suggests a changing genomic landscape of C. difficile. Our study provides insight into the nature of prevalent C. difficile strains in a hospital setting and transmission patterns among carriers. Longitudinal sampling of surfaces and patient stool revealed that both toxigenic and nontoxigenic strains of C. difficile clade 1 dominate these two wards. Moreover, quantification of transmission in carriers of these clade 1 isolates underscores the need to revisit infection prevention measures in this patient group. We identified unique genetic signatures associated with virulence in this clade. Our data highlight the complexities of preventing transmission of this pathogen in a hospital setting and the need to investigate the mechanisms of in vivo persistence and virulence of prevalent lineages in the host gut microbiome.},
}
RevDate: 2024-02-07
Comparative genomics analysis reveals genetic characteristics and nitrogen fixation profile of Bradyrhizobium.
iScience, 27(2):108948 pii:S2589-0042(24)00169-X.
Bradyrhizobium is a genus of nitrogen-fixing bacteria, with some species producing nodules in leguminous plants. Investigations into Bradyrhizobium have recently revealed its substantial genetic resources and agricultural benefits, but a comprehensive survey of its genetic diversity and functional properties is lacking. Using a panel of various strains (N = 278), this study performed a comparative genomics analysis to anticipate genes linked with symbiotic nitrogen fixation. Bradyrhizobium's pan-genome consisted of 84,078 gene families, containing 824 core genes and 42,409 accessory genes. Core genes were mainly involved in crucial cell processes, while accessory genes served diverse functions, including nitrogen fixation and nodulation. Three distinct genetic profiles were identified based on the presence/absence of gene clusters related to nodulation, nitrogen fixation, and secretion systems. Most Bradyrhizobium strains from soil and non-leguminous plants lacked major nif/nod genes and were evolutionarily more closely related. These findings shed light on Bradyrhizobium's genetic features for symbiotic nitrogen fixation.
Additional Links: PMID-38322985
Full Text:
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38322985,
year = {2024},
author = {Zhong, C and Hu, G and Hu, C and Xu, C and Zhang, Z and Ning, K},
title = {Comparative genomics analysis reveals genetic characteristics and nitrogen fixation profile of Bradyrhizobium.},
journal = {iScience},
volume = {27},
number = {2},
pages = {108948},
doi = {10.1016/j.isci.2024.108948},
pmid = {38322985},
issn = {2589-0042},
abstract = {Bradyrhizobium is a genus of nitrogen-fixing bacteria, with some species producing nodules in leguminous plants. Investigations into Bradyrhizobium have recently revealed its substantial genetic resources and agricultural benefits, but a comprehensive survey of its genetic diversity and functional properties is lacking. Using a panel of various strains (N = 278), this study performed a comparative genomics analysis to anticipate genes linked with symbiotic nitrogen fixation. Bradyrhizobium's pan-genome consisted of 84,078 gene families, containing 824 core genes and 42,409 accessory genes. Core genes were mainly involved in crucial cell processes, while accessory genes served diverse functions, including nitrogen fixation and nodulation. Three distinct genetic profiles were identified based on the presence/absence of gene clusters related to nodulation, nitrogen fixation, and secretion systems. Most Bradyrhizobium strains from soil and non-leguminous plants lacked major nif/nod genes and were evolutionarily more closely related. These findings shed light on Bradyrhizobium's genetic features for symbiotic nitrogen fixation.},
}
RevDate: 2024-02-02
A sequence-aware merger of genomic structural variations at population scale.
Nature communications, 15(1):960.
Merging structural variations (SVs) at the population level presents a significant challenge, yet it is essential for conducting comprehensive genotypic analyses, especially in the era of pangenomics. Here, we introduce PanPop, a tool that utilizes an advanced sequence-aware SV merging algorithm to efficiently merge SVs of various types. We demonstrate that PanPop can merge and optimize the majority of multiallelic SVs into informative biallelic variants. We show its superior precision and lower rates of missing data compared to alternative software solutions. Our approach not only enables the filtering of SVs by leveraging multiple SV callers for enhanced accuracy but also facilitates the accurate merging of large-scale population SVs. These capabilities of PanPop will help to accelerate future SV-related studies.
Additional Links: PMID-38307885
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38307885,
year = {2024},
author = {Zheng, Z and Zhu, M and Zhang, J and Liu, X and Hou, L and Liu, W and Yuan, S and Luo, C and Yao, X and Liu, J and Yang, Y},
title = {A sequence-aware merger of genomic structural variations at population scale.},
journal = {Nature communications},
volume = {15},
number = {1},
pages = {960},
pmid = {38307885},
issn = {2041-1723},
abstract = {Merging structural variations (SVs) at the population level presents a significant challenge, yet it is essential for conducting comprehensive genotypic analyses, especially in the era of pangenomics. Here, we introduce PanPop, a tool that utilizes an advanced sequence-aware SV merging algorithm to efficiently merge SVs of various types. We demonstrate that PanPop can merge and optimize the majority of multiallelic SVs into informative biallelic variants. We show its superior precision and lower rates of missing data compared to alternative software solutions. Our approach not only enables the filtering of SVs by leveraging multiple SV callers for enhanced accuracy but also facilitates the accurate merging of large-scale population SVs. These capabilities of PanPop will help to accelerate future SV-related studies.},
}
RevDate: 2024-02-02
Comparative genomic analyses of Cutibacterium granulosum provide insights into genomic diversity.
Frontiers in microbiology, 15:1343227.
Cutibacterium granulosum, a commensal bacterium found on human skin, formerly known as Propionibacterium granulosum, rarely causes infections and is generally considered non-pathogenic. Recent research has revealed the transferability of the multidrug-resistant plasmid pTZC1 between C. granulosum and Cutibacterium acnes, the latter being an opportunistic pathogen in surgical site infections. However, there is a noticeable lack of research on the genome of C. granulosum, and the genetic landscape of this species remains largely uncharted. We investigated the genomic features and evolutionary structure of C. granulosum by analyzing a total of 30 Metagenome-Assembled Genomes (MAGs) and isolate genomes retrieved from public databases, as well as those generated in this study. A pan-genome of 6,077 genes was identified for C. granulosum. Remarkably, the 'cloud genes' constituted 62.38% of the pan-genome. Genes associated with mobilome: prophages, transposons [X], defense mechanisms [V] and replication, recombination and repair [L] were enriched in the cloud genome. Phylogenomic analysis revealed two distinct mono-clades, highlighting the genomic diversity of C. granulosum. The genomic diversity was further confirmed by the distribution of Average Nucleotide Identity (ANI) values. The functional profiles analysis of C. granulosum unveiled a wide range of potential Antibiotic Resistance Genes (ARGs) and virulence factors, suggesting its potential tolerance to various environmental challenges. Subtype I-E of the CRISPR-Cas system was the most abundant in these genomes, a feature also detected in C. acnes genomes. Given the widespread distribution of C. granulosum strains within skin microbiome, our findings make a substantial contribution to our broader understanding of the genetic diversity, which may open new avenues for investigating the mechanisms and treatment of conditions such as acne vulgaris.
Additional Links: PMID-38304712
Full Text:
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38304712,
year = {2024},
author = {Chen, P and Wang, S and Li, H and Qi, X and Hou, Y and Ma, T},
title = {Comparative genomic analyses of Cutibacterium granulosum provide insights into genomic diversity.},
journal = {Frontiers in microbiology},
volume = {15},
number = {},
pages = {1343227},
doi = {10.3389/fmicb.2024.1343227},
pmid = {38304712},
issn = {1664-302X},
abstract = {Cutibacterium granulosum, a commensal bacterium found on human skin, formerly known as Propionibacterium granulosum, rarely causes infections and is generally considered non-pathogenic. Recent research has revealed the transferability of the multidrug-resistant plasmid pTZC1 between C. granulosum and Cutibacterium acnes, the latter being an opportunistic pathogen in surgical site infections. However, there is a noticeable lack of research on the genome of C. granulosum, and the genetic landscape of this species remains largely uncharted. We investigated the genomic features and evolutionary structure of C. granulosum by analyzing a total of 30 Metagenome-Assembled Genomes (MAGs) and isolate genomes retrieved from public databases, as well as those generated in this study. A pan-genome of 6,077 genes was identified for C. granulosum. Remarkably, the 'cloud genes' constituted 62.38% of the pan-genome. Genes associated with mobilome: prophages, transposons [X], defense mechanisms [V] and replication, recombination and repair [L] were enriched in the cloud genome. Phylogenomic analysis revealed two distinct mono-clades, highlighting the genomic diversity of C. granulosum. The genomic diversity was further confirmed by the distribution of Average Nucleotide Identity (ANI) values. The functional profiles analysis of C. granulosum unveiled a wide range of potential Antibiotic Resistance Genes (ARGs) and virulence factors, suggesting its potential tolerance to various environmental challenges. Subtype I-E of the CRISPR-Cas system was the most abundant in these genomes, a feature also detected in C. acnes genomes. Given the widespread distribution of C. granulosum strains within skin microbiome, our findings make a substantial contribution to our broader understanding of the genetic diversity, which may open new avenues for investigating the mechanisms and treatment of conditions such as acne vulgaris.},
}
RevDate: 2024-02-01
The Impact of Patterns in Linkage Disequilibrium and Sequencing Quality on the Imprint of Balancing Selection.
Genome biology and evolution pii:7596324 [Epub ahead of print].
Regions under balancing selection are characterized by dense polymorphisms and multiple persistent haplotypes, along with other sequence complexities. Successful identification of these patterns depends on both the statistical approach and the quality of sequencing. To address this challenge, at first, a new statistical method called LD-ABF was developed, employing efficient Bayesian techniques to effectively test for balancing selection. LD-ABF demonstrated the most robust detection of selection in a variety of simulation scenarios, compared against a range of existing tests/tools (Tajima's D, HKA, Dng, BetaScan, and BalLerMix). Furthermore, the impact of the quality of sequencing on detection of balancing selection was explored, as well, using: 1) SNP genotyping and exome data, 2) targeted high-resolution HLA genotyping (IHIW), and 3) whole-genome long-read sequencing data (Pangenome). In the analysis of SNP genotyping and exome data, we identified known targets and 38 new selection signatures in genes not previously linked to balancing selection. To further investigate the impact of sequencing quality on detection of balancing selection, a detailed investigation of the MHC was performed with high-resolution HLA typing data. Higher quality sequencing revealed the HLA-DQ genes consistently demonstrated strong selection signatures otherwise not observed from the sparser SNP array and exome data. The HLA-DQ selection signature was also replicated in the Pangenome samples using considerably less samples but, with high quality long-read sequence data. The improved statistical method, coupled with higher quality sequencing, leads to more consistent identification of selection and enhanced localization of variants under selection, particularly in complex regions.
Additional Links: PMID-38302106
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38302106,
year = {2024},
author = {Hayeck, TJ and Li, Y and Mosbruger, TL and Bradfield, JP and Gleason, AG and Damianos, G and Shaw, GT and Duke, JL and Conlin, LK and Turner, TN and Fernández-Viña, MA and Sarmady, M and Monos, DS},
title = {The Impact of Patterns in Linkage Disequilibrium and Sequencing Quality on the Imprint of Balancing Selection.},
journal = {Genome biology and evolution},
volume = {},
number = {},
pages = {},
doi = {10.1093/gbe/evae009},
pmid = {38302106},
issn = {1759-6653},
abstract = {Regions under balancing selection are characterized by dense polymorphisms and multiple persistent haplotypes, along with other sequence complexities. Successful identification of these patterns depends on both the statistical approach and the quality of sequencing. To address this challenge, at first, a new statistical method called LD-ABF was developed, employing efficient Bayesian techniques to effectively test for balancing selection. LD-ABF demonstrated the most robust detection of selection in a variety of simulation scenarios, compared against a range of existing tests/tools (Tajima's D, HKA, Dng, BetaScan, and BalLerMix). Furthermore, the impact of the quality of sequencing on detection of balancing selection was explored, as well, using: 1) SNP genotyping and exome data, 2) targeted high-resolution HLA genotyping (IHIW), and 3) whole-genome long-read sequencing data (Pangenome). In the analysis of SNP genotyping and exome data, we identified known targets and 38 new selection signatures in genes not previously linked to balancing selection. To further investigate the impact of sequencing quality on detection of balancing selection, a detailed investigation of the MHC was performed with high-resolution HLA typing data. Higher quality sequencing revealed the HLA-DQ genes consistently demonstrated strong selection signatures otherwise not observed from the sparser SNP array and exome data. The HLA-DQ selection signature was also replicated in the Pangenome samples using considerably less samples but, with high quality long-read sequence data. The improved statistical method, coupled with higher quality sequencing, leads to more consistent identification of selection and enhanced localization of variants under selection, particularly in complex regions.},
}
RevDate: 2024-02-01
Complete genome sequence and potential pathogenic assessment of Flavobacterium plurextorum RSG-18 isolated from the gut of Schlegel's black rockfish, Sebastes schlegelii.
Environmental microbiology reports [Epub ahead of print].
Flavobacterium plurextorum is a potential fish pathogen of interest, previously isolated from diseased rainbow trout (Oncorhynchus mykiss) and oomycete-infected chum salmon (Oncorhynchus keta) eggs. We report here the first complete genome sequence of F. plurextorum RSG-18 isolated from the gut of Schlegel's black rockfish (Sebastes schlegelii). The genome of RSG-18 consists of a circular chromosome of 5,610,911 bp with a 33.57% GC content, containing 4858 protein-coding genes, 18 rRNAs, 63 tRNAs and 1 tmRNA. A comparative analysis was conducted on 11 Flavobacterium species previously reported as pathogens or isolated from diseased fish to confirm the potential pathogenicity of RSG-18. In the SEED classification, RSG-18 was found to have 36 genes categorized in 'Virulence, Disease and Defense'. Across all Flavobacterium species, a total of 16 antibiotic resistance genes and 61 putative virulence factors were identified. All species had at least one phage region and type I, III and IX secretion systems. In pan-genomic analysis, core genes consist of genes linked to phages, integrases and matrix-tolerated elements associated with pathology. The complete genome sequence of F. plurextorum RSG-18 will serve as a foundation for future research, enhancing our understanding of Flavobacterium pathogenicity in fish and contributing to the development of effective prevention strategies.
Additional Links: PMID-38298071
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38298071,
year = {2024},
author = {Lee, J and Cha, IT and Lee, KE and Son, YK and Cho, S and Seol, D},
title = {Complete genome sequence and potential pathogenic assessment of Flavobacterium plurextorum RSG-18 isolated from the gut of Schlegel's black rockfish, Sebastes schlegelii.},
journal = {Environmental microbiology reports},
volume = {},
number = {},
pages = {},
doi = {10.1111/1758-2229.13226},
pmid = {38298071},
issn = {1758-2229},
support = {NIBR202134204//National Institute of Biological Resources, Ministry of Environment/ ; },
abstract = {Flavobacterium plurextorum is a potential fish pathogen of interest, previously isolated from diseased rainbow trout (Oncorhynchus mykiss) and oomycete-infected chum salmon (Oncorhynchus keta) eggs. We report here the first complete genome sequence of F. plurextorum RSG-18 isolated from the gut of Schlegel's black rockfish (Sebastes schlegelii). The genome of RSG-18 consists of a circular chromosome of 5,610,911 bp with a 33.57% GC content, containing 4858 protein-coding genes, 18 rRNAs, 63 tRNAs and 1 tmRNA. A comparative analysis was conducted on 11 Flavobacterium species previously reported as pathogens or isolated from diseased fish to confirm the potential pathogenicity of RSG-18. In the SEED classification, RSG-18 was found to have 36 genes categorized in 'Virulence, Disease and Defense'. Across all Flavobacterium species, a total of 16 antibiotic resistance genes and 61 putative virulence factors were identified. All species had at least one phage region and type I, III and IX secretion systems. In pan-genomic analysis, core genes consist of genes linked to phages, integrases and matrix-tolerated elements associated with pathology. The complete genome sequence of F. plurextorum RSG-18 will serve as a foundation for future research, enhancing our understanding of Flavobacterium pathogenicity in fish and contributing to the development of effective prevention strategies.},
}
RevDate: 2024-01-31
Prevalence and Molecular Characteristics of Ceftazidime-avibactam Resistance among carbapenem-resistant Pseudomonas aeruginosa Clinical Isolates.
Journal of global antimicrobial resistance pii:S2213-7165(24)00014-6 [Epub ahead of print].
BACKGROUND: Resistance against ceftazidime-avibactam (CZA) in carbapenem-resistant Pseudomonas aeruginosa (CRPA) is emerging. This study was aimed at detecting the prevalence and molecular characteristics of CZA-resistant CRPA clinical isolates in Guangdong Province, China.
METHODS: The antimicrobial susceptibility profile of these strains was determined. A subset of sixteen CZA-resistant CRPA isolates was analyzed by whole genome sequencing (WGS). Genetic surroundings of carbapenem resistance genes and pan-genome-wide association analysis were further studied.
RESULTS: Of the 250 CRPA isolates, CZA resistance rate was 6.4% (16/250). The minimum inhibitory concentration (MIC) of CZA range was from 0.25 to >256 mg/L. MIC50 and MIC90 were 2/4 and 8/4 mg/L, respectively. Among the sixteen CZA-resistant CRPA strains, 31.3% (5/16) of them carried class B carbapenem resistance genes including blaIMP-4, blaIMP-45 and blaVIM-2, located on IncP-2 megaplasmids or chromosome, respectively. Pan-genome-wide association analysis of accessory genes for CZA-susceptible or -resistant CRPA isolates showed that PA1874, a hypothetical protein containing BapA prefix-like domain, was enriched in CZA-resistant group significantly.
CONCLUSIONS: Class B carbapenem resistance genes play important roles in CZA resistance. Meanwhile, PA1874 gene may be a novel mechanism involving in CZA resistance. It is necessary to continually monitor CZA-resistant CRPA isolates.
Additional Links: PMID-38295902
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38295902,
year = {2024},
author = {Chen, Y and Xiang, G and Liu, P and Zhou, X and Guo, P and Wu, Z and Yang, J and Chen, P and Huang, J and Liao, K},
title = {Prevalence and Molecular Characteristics of Ceftazidime-avibactam Resistance among carbapenem-resistant Pseudomonas aeruginosa Clinical Isolates.},
journal = {Journal of global antimicrobial resistance},
volume = {},
number = {},
pages = {},
doi = {10.1016/j.jgar.2024.01.014},
pmid = {38295902},
issn = {2213-7173},
abstract = {BACKGROUND: Resistance against ceftazidime-avibactam (CZA) in carbapenem-resistant Pseudomonas aeruginosa (CRPA) is emerging. This study was aimed at detecting the prevalence and molecular characteristics of CZA-resistant CRPA clinical isolates in Guangdong Province, China.
METHODS: The antimicrobial susceptibility profile of these strains was determined. A subset of sixteen CZA-resistant CRPA isolates was analyzed by whole genome sequencing (WGS). Genetic surroundings of carbapenem resistance genes and pan-genome-wide association analysis were further studied.
RESULTS: Of the 250 CRPA isolates, CZA resistance rate was 6.4% (16/250). The minimum inhibitory concentration (MIC) of CZA range was from 0.25 to >256 mg/L. MIC50 and MIC90 were 2/4 and 8/4 mg/L, respectively. Among the sixteen CZA-resistant CRPA strains, 31.3% (5/16) of them carried class B carbapenem resistance genes including blaIMP-4, blaIMP-45 and blaVIM-2, located on IncP-2 megaplasmids or chromosome, respectively. Pan-genome-wide association analysis of accessory genes for CZA-susceptible or -resistant CRPA isolates showed that PA1874, a hypothetical protein containing BapA prefix-like domain, was enriched in CZA-resistant group significantly.
CONCLUSIONS: Class B carbapenem resistance genes play important roles in CZA resistance. Meanwhile, PA1874 gene may be a novel mechanism involving in CZA resistance. It is necessary to continually monitor CZA-resistant CRPA isolates.},
}
RevDate: 2024-01-31
Insights into group-specific pattern of secondary metabolite gene cluster in Burkholderia genus.
Frontiers in microbiology, 14:1302236.
Burkholderia is a versatile strain that has expanded into several genera. It has been steadily reported that the genome features of Burkholderia exhibit activities ranging from plant growth promotion to pathogenicity across various isolation areas. The objective of this study was to investigate the secondary metabolite patterns of 366 Burkholderia species through comparative genomics. Samples were selected based on assembly quality assessment and similarity below 80% in average nucleotide identity. Duplicate samples were excluded. Samples were divided into two groups using FastANI analysis. Group A included B. pseudomallei complex. Group B included B. cepacia complex. The limitations of MLST were proposed. The detection of genes was performed, including environmental and virulence-related genes. In the pan-genome analysis, each complex possessed a similar pattern of cluster for orthologous groups. Group A (n = 185) had 14,066 cloud genes, 2,465 shell genes, 682 soft-core genes, and 2,553 strict-core genes. Group B (n = 181) had 39,867 cloud genes, 4,986 shell genes, 324 soft-core genes, 222 core genes, and 2,949 strict-core genes. AntiSMASH was employed to analyze the biosynthetic gene cluster (BGC). The results were then utilized for network analysis using BiG-SCAPE and CORASON. Principal component analysis was conducted and a table was constructed using the results obtained from antiSMASH. The results were divided into Group A and Group B. We expected the various species to show similar patterns of secondary metabolite gene clusters. For in-depth analysis, a network analysis of secondary metabolite gene clusters was conducted, exemplified by BiG-SCAPE analysis. Depending on the species and complex, Burkholderia possessed several kinds of siderophore. Among them, ornibactin was possessed in most Burkholderia and was clustered into 4,062 clans. There was a similar pattern of gene clusters depending on the species. NRPS_04014 belonged to siderophore BGCs including ornibactin and indigoidine. However, it was observed that each family included a similar species. This suggests that, besides siderophores being species-specific, the ornibactin gene cluster itself might also be species-specific. The results suggest that siderophores are associated with environmental adaptation, possessing a similar pattern of siderophore gene clusters among species, which could provide another perspective on species-specific environmental adaptation mechanisms.
Additional Links: PMID-38293557
Full Text:
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38293557,
year = {2023},
author = {Kim, B and Han, SR and Lee, H and Oh, TJ},
title = {Insights into group-specific pattern of secondary metabolite gene cluster in Burkholderia genus.},
journal = {Frontiers in microbiology},
volume = {14},
number = {},
pages = {1302236},
doi = {10.3389/fmicb.2023.1302236},
pmid = {38293557},
issn = {1664-302X},
abstract = {Burkholderia is a versatile strain that has expanded into several genera. It has been steadily reported that the genome features of Burkholderia exhibit activities ranging from plant growth promotion to pathogenicity across various isolation areas. The objective of this study was to investigate the secondary metabolite patterns of 366 Burkholderia species through comparative genomics. Samples were selected based on assembly quality assessment and similarity below 80% in average nucleotide identity. Duplicate samples were excluded. Samples were divided into two groups using FastANI analysis. Group A included B. pseudomallei complex. Group B included B. cepacia complex. The limitations of MLST were proposed. The detection of genes was performed, including environmental and virulence-related genes. In the pan-genome analysis, each complex possessed a similar pattern of cluster for orthologous groups. Group A (n = 185) had 14,066 cloud genes, 2,465 shell genes, 682 soft-core genes, and 2,553 strict-core genes. Group B (n = 181) had 39,867 cloud genes, 4,986 shell genes, 324 soft-core genes, 222 core genes, and 2,949 strict-core genes. AntiSMASH was employed to analyze the biosynthetic gene cluster (BGC). The results were then utilized for network analysis using BiG-SCAPE and CORASON. Principal component analysis was conducted and a table was constructed using the results obtained from antiSMASH. The results were divided into Group A and Group B. We expected the various species to show similar patterns of secondary metabolite gene clusters. For in-depth analysis, a network analysis of secondary metabolite gene clusters was conducted, exemplified by BiG-SCAPE analysis. Depending on the species and complex, Burkholderia possessed several kinds of siderophore. Among them, ornibactin was possessed in most Burkholderia and was clustered into 4,062 clans. There was a similar pattern of gene clusters depending on the species. NRPS_04014 belonged to siderophore BGCs including ornibactin and indigoidine. However, it was observed that each family included a similar species. This suggests that, besides siderophores being species-specific, the ornibactin gene cluster itself might also be species-specific. The results suggest that siderophores are associated with environmental adaptation, possessing a similar pattern of siderophore gene clusters among species, which could provide another perspective on species-specific environmental adaptation mechanisms.},
}
RevDate: 2024-01-30
Distinct genomic contexts predict gene presence-absence variation in different pathotypes of Magnaporthe oryzae.
Genetics pii:7593594 [Epub ahead of print].
Fungi use the accessory gene content of their pangenomes to adapt to their environments. While gene presence-absence variation (PAV) contributes to shaping accessory gene reservoirs, the genomic contexts that shape these events remain unclear. Since pangenome studies are typically species-wide and do not analyze different populations separately, it is yet to be uncovered whether PAV patterns and mechanisms are consistent across populations. Fungal plant pathogens are useful models for studying PAV because they rely on it to adapt to their hosts, and members of a species often infect distinct hosts. We analyzed gene PAV in the blast fungus, Magnaporthe oryzae (syn. Pyricularia oryzae), and found that PAV genes involved in host-pathogen and microbe-microbe interactions may drive the adaptation of the fungus to its environment. We then analyzed genomic and epigenomic features of PAV and observed that proximity to transposable elements, gene GC content, gene length, expression level in the host, and histone H3K27me3 marks were different between PAV genes and conserved genes. We used these features to construct a model that was able to predict whether a gene is likely to experience PAV with high precision (86.06%) and recall (92.88%) in M. oryzae. Finally, we found that PAV genes in the rice and wheat pathotypes of M. oryzae differed in their number and their genomic context. Our results suggest that genomic and epigenomic features of gene PAV can be used to better understand and predict fungal pangenome evolution. We also show that substantial intra-species variation can exist in these features.
Additional Links: PMID-38290434
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38290434,
year = {2024},
author = {Joubert, PM and Krasileva, KV},
title = {Distinct genomic contexts predict gene presence-absence variation in different pathotypes of Magnaporthe oryzae.},
journal = {Genetics},
volume = {},
number = {},
pages = {},
doi = {10.1093/genetics/iyae012},
pmid = {38290434},
issn = {1943-2631},
abstract = {Fungi use the accessory gene content of their pangenomes to adapt to their environments. While gene presence-absence variation (PAV) contributes to shaping accessory gene reservoirs, the genomic contexts that shape these events remain unclear. Since pangenome studies are typically species-wide and do not analyze different populations separately, it is yet to be uncovered whether PAV patterns and mechanisms are consistent across populations. Fungal plant pathogens are useful models for studying PAV because they rely on it to adapt to their hosts, and members of a species often infect distinct hosts. We analyzed gene PAV in the blast fungus, Magnaporthe oryzae (syn. Pyricularia oryzae), and found that PAV genes involved in host-pathogen and microbe-microbe interactions may drive the adaptation of the fungus to its environment. We then analyzed genomic and epigenomic features of PAV and observed that proximity to transposable elements, gene GC content, gene length, expression level in the host, and histone H3K27me3 marks were different between PAV genes and conserved genes. We used these features to construct a model that was able to predict whether a gene is likely to experience PAV with high precision (86.06%) and recall (92.88%) in M. oryzae. Finally, we found that PAV genes in the rice and wheat pathotypes of M. oryzae differed in their number and their genomic context. Our results suggest that genomic and epigenomic features of gene PAV can be used to better understand and predict fungal pangenome evolution. We also show that substantial intra-species variation can exist in these features.},
}
RevDate: 2024-01-28
Analysis of five near-complete genome assemblies of the tomato pathogen Cladosporium fulvum uncovers additional accessory chromosomes and structural variations induced by transposable elements effecting the loss of avirulence genes.
BMC biology, 22(1):25.
BACKGROUND: Fungal plant pathogens have dynamic genomes that allow them to rapidly adapt to adverse conditions and overcome host resistance. One way by which this dynamic genome plasticity is expressed is through effector gene loss, which enables plant pathogens to overcome recognition by cognate resistance genes in the host. However, the exact nature of these loses remains elusive in many fungi. This includes the tomato pathogen Cladosporium fulvum, which is the first fungal plant pathogen from which avirulence (Avr) genes were ever cloned and in which loss of Avr genes is often reported as a means of overcoming recognition by cognate tomato Cf resistance genes. A recent near-complete reference genome assembly of C. fulvum isolate Race 5 revealed a compartmentalized genome architecture and the presence of an accessory chromosome, thereby creating a basis for studying genome plasticity in fungal plant pathogens and its impact on avirulence genes.
RESULTS: Here, we obtained near-complete genome assemblies of four additional C. fulvum isolates. The genome assemblies had similar sizes (66.96 to 67.78 Mb), number of predicted genes (14,895 to 14,981), and estimated completeness (98.8 to 98.9%). Comparative analysis that included the genome of isolate Race 5 revealed high levels of synteny and colinearity, which extended to the density and distribution of repetitive elements and of repeat-induced point (RIP) mutations across homologous chromosomes. Nonetheless, structural variations, likely mediated by transposable elements and effecting the deletion of the avirulence genes Avr4E, Avr5, and Avr9, were also identified. The isolates further shared a core set of 13 chromosomes, but two accessory chromosomes were identified as well. Accessory chromosomes were significantly smaller in size, and one carried pseudogenized copies of two effector genes. Whole-genome alignments further revealed genomic islands of near-zero nucleotide diversity interspersed with islands of high nucleotide diversity that co-localized with repeat-rich regions. These regions were likely generated by RIP, which generally asymmetrically affected the genome of C. fulvum.
CONCLUSIONS: Our results reveal new evolutionary aspects of the C. fulvum genome and provide new insights on the importance of genomic structural variations in overcoming host resistance in fungal plant pathogens.
Additional Links: PMID-38281938
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38281938,
year = {2024},
author = {Zaccaron, AZ and Stergiopoulos, I},
title = {Analysis of five near-complete genome assemblies of the tomato pathogen Cladosporium fulvum uncovers additional accessory chromosomes and structural variations induced by transposable elements effecting the loss of avirulence genes.},
journal = {BMC biology},
volume = {22},
number = {1},
pages = {25},
pmid = {38281938},
issn = {1741-7007},
support = {1557995//Directorate for Biological Sciences/ ; CA-D-PPA-2185-H//National Institute of Food and Agriculture/ ; },
abstract = {BACKGROUND: Fungal plant pathogens have dynamic genomes that allow them to rapidly adapt to adverse conditions and overcome host resistance. One way by which this dynamic genome plasticity is expressed is through effector gene loss, which enables plant pathogens to overcome recognition by cognate resistance genes in the host. However, the exact nature of these loses remains elusive in many fungi. This includes the tomato pathogen Cladosporium fulvum, which is the first fungal plant pathogen from which avirulence (Avr) genes were ever cloned and in which loss of Avr genes is often reported as a means of overcoming recognition by cognate tomato Cf resistance genes. A recent near-complete reference genome assembly of C. fulvum isolate Race 5 revealed a compartmentalized genome architecture and the presence of an accessory chromosome, thereby creating a basis for studying genome plasticity in fungal plant pathogens and its impact on avirulence genes.
RESULTS: Here, we obtained near-complete genome assemblies of four additional C. fulvum isolates. The genome assemblies had similar sizes (66.96 to 67.78 Mb), number of predicted genes (14,895 to 14,981), and estimated completeness (98.8 to 98.9%). Comparative analysis that included the genome of isolate Race 5 revealed high levels of synteny and colinearity, which extended to the density and distribution of repetitive elements and of repeat-induced point (RIP) mutations across homologous chromosomes. Nonetheless, structural variations, likely mediated by transposable elements and effecting the deletion of the avirulence genes Avr4E, Avr5, and Avr9, were also identified. The isolates further shared a core set of 13 chromosomes, but two accessory chromosomes were identified as well. Accessory chromosomes were significantly smaller in size, and one carried pseudogenized copies of two effector genes. Whole-genome alignments further revealed genomic islands of near-zero nucleotide diversity interspersed with islands of high nucleotide diversity that co-localized with repeat-rich regions. These regions were likely generated by RIP, which generally asymmetrically affected the genome of C. fulvum.
CONCLUSIONS: Our results reveal new evolutionary aspects of the C. fulvum genome and provide new insights on the importance of genomic structural variations in overcoming host resistance in fungal plant pathogens.},
}
RevDate: 2024-01-26
Co-linear chaining on pangenome graphs.
Algorithms for molecular biology : AMB, 19(1):4.
Pangenome reference graphs are useful in genomics because they compactly represent the genetic diversity within a species, a capability that linear references lack. However, efficiently aligning sequences to these graphs with complex topology and cycles can be challenging. The seed-chain-extend based alignment algorithms use co-linear chaining as a standard technique to identify a good cluster of exact seed matches that can be combined to form an alignment. Recent works show how the co-linear chaining problem can be efficiently solved for acyclic pangenome graphs by exploiting their small width and how incorporating gap cost in the scoring function improves alignment accuracy. However, it remains open on how to effectively generalize these techniques for general pangenome graphs which contain cycles. Here we present the first practical formulation and an exact algorithm for co-linear chaining on cyclic pangenome graphs. We rigorously prove the correctness and computational complexity of the proposed algorithm. We evaluate the empirical performance of our algorithm by aligning simulated long reads from the human genome to a cyclic pangenome graph constructed from 95 publicly available haplotype-resolved human genome assemblies. While the existing heuristic-based algorithms are faster, the proposed algorithm provides a significant advantage in terms of accuracy. Implementation (https://github.com/at-cg/PanAligner).
Additional Links: PMID-38279113
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38279113,
year = {2024},
author = {Rajput, J and Chandra, G and Jain, C},
title = {Co-linear chaining on pangenome graphs.},
journal = {Algorithms for molecular biology : AMB},
volume = {19},
number = {1},
pages = {4},
pmid = {38279113},
issn = {1748-7188},
abstract = {Pangenome reference graphs are useful in genomics because they compactly represent the genetic diversity within a species, a capability that linear references lack. However, efficiently aligning sequences to these graphs with complex topology and cycles can be challenging. The seed-chain-extend based alignment algorithms use co-linear chaining as a standard technique to identify a good cluster of exact seed matches that can be combined to form an alignment. Recent works show how the co-linear chaining problem can be efficiently solved for acyclic pangenome graphs by exploiting their small width and how incorporating gap cost in the scoring function improves alignment accuracy. However, it remains open on how to effectively generalize these techniques for general pangenome graphs which contain cycles. Here we present the first practical formulation and an exact algorithm for co-linear chaining on cyclic pangenome graphs. We rigorously prove the correctness and computational complexity of the proposed algorithm. We evaluate the empirical performance of our algorithm by aligning simulated long reads from the human genome to a cyclic pangenome graph constructed from 95 publicly available haplotype-resolved human genome assemblies. While the existing heuristic-based algorithms are faster, the proposed algorithm provides a significant advantage in terms of accuracy. Implementation (https://github.com/at-cg/PanAligner).},
}
RevDate: 2024-01-26
Genomic landscape of NDM-1 producing multidrug-resistant Providencia stuartii causing burn wound infections in Bangladesh.
Scientific reports, 14(1):2246.
The increasing antimicrobial resistance in Providencia stuartii (P. stuartii) worldwide, particularly concerning for immunocompromised and burn patients, has raised concern in Bangladesh, where the significance of this infectious opportunistic pathogen had been previously overlooked, prompting a need for investigation. The two strains of P. stuartii (P. stuartii SHNIBPS63 and P. stuartii SHNIBPS71) isolated from wound swab of two critically injured burn patients were found to be multidrug-resistant and P. stuartii SHNIBPS63 showed resistance to all the 22 antibiotics tested as well as revealed the co-existence of blaVEB-6 (Class A), blaNDM-1 (Class B), blaOXA-10 (Class D) beta lactamase genes. Complete resistance to carbapenems through the production of NDM-1, is indicative of an alarming situation as carbapenems are considered to be the last line antibiotic to combat this pathogen. Both isolates displayed strong biofilm-forming abilities and exhibited resistance to copper, zinc, and iron, in addition to carrying multiple genes associated with metal resistance and the formation of biofilms. The study also encompassed a pangenome analysis utilizing a dataset of eighty-six publicly available P. stuartii genomes (n = 86), revealing evidence of an open or expanding pangenome for P. stuartii. Also, an extensive genome-wide analysis of all the P. stuartii genomes revealed a concerning global prevalence of diverse antimicrobial resistance genes, with a particular alarm raised over the abundance of carbapenem resistance gene blaNDM-1. Additionally, this study highlighted the notable genetic diversity within P. stuartii, significant informations about phylogenomic relationships and ancestry, as well as potential for cross-species transmission, raising important implications for public health and microbial adaptation across different environments.
Additional Links: PMID-38278862
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38278862,
year = {2024},
author = {Mondol, SM and Islam, I and Islam, MR and Shakil, SK and Rakhi, NN and Mustary, JF and Amiruzzaman, and Gomes, DJ and Shahjalal, HM and Rahaman, MM},
title = {Genomic landscape of NDM-1 producing multidrug-resistant Providencia stuartii causing burn wound infections in Bangladesh.},
journal = {Scientific reports},
volume = {14},
number = {1},
pages = {2246},
pmid = {38278862},
issn = {2045-2322},
support = {LS2019935//Ministry of Education, Government of the People's Republic of Bangladesh/ ; },
abstract = {The increasing antimicrobial resistance in Providencia stuartii (P. stuartii) worldwide, particularly concerning for immunocompromised and burn patients, has raised concern in Bangladesh, where the significance of this infectious opportunistic pathogen had been previously overlooked, prompting a need for investigation. The two strains of P. stuartii (P. stuartii SHNIBPS63 and P. stuartii SHNIBPS71) isolated from wound swab of two critically injured burn patients were found to be multidrug-resistant and P. stuartii SHNIBPS63 showed resistance to all the 22 antibiotics tested as well as revealed the co-existence of blaVEB-6 (Class A), blaNDM-1 (Class B), blaOXA-10 (Class D) beta lactamase genes. Complete resistance to carbapenems through the production of NDM-1, is indicative of an alarming situation as carbapenems are considered to be the last line antibiotic to combat this pathogen. Both isolates displayed strong biofilm-forming abilities and exhibited resistance to copper, zinc, and iron, in addition to carrying multiple genes associated with metal resistance and the formation of biofilms. The study also encompassed a pangenome analysis utilizing a dataset of eighty-six publicly available P. stuartii genomes (n = 86), revealing evidence of an open or expanding pangenome for P. stuartii. Also, an extensive genome-wide analysis of all the P. stuartii genomes revealed a concerning global prevalence of diverse antimicrobial resistance genes, with a particular alarm raised over the abundance of carbapenem resistance gene blaNDM-1. Additionally, this study highlighted the notable genetic diversity within P. stuartii, significant informations about phylogenomic relationships and ancestry, as well as potential for cross-species transmission, raising important implications for public health and microbial adaptation across different environments.},
}
RevDate: 2024-01-25
Bioinformatics of germline variant discovery for rare disease diagnostics: current approaches and remaining challenges.
Briefings in bioinformatics, 25(2):.
Next-generation sequencing (NGS) has revolutionized the field of rare disease diagnostics. Whole exome and whole genome sequencing are now routinely used for diagnostic purposes; however, the overall diagnosis rate remains lower than expected. In this work, we review current approaches used for calling and interpretation of germline genetic variants in the human genome, and discuss the most important challenges that persist in the bioinformatic analysis of NGS data in medical genetics. We describe and attempt to quantitatively assess the remaining problems, such as the quality of the reference genome sequence, reproducible coverage biases, or variant calling accuracy in complex regions of the genome. We also discuss the prospects of switching to the complete human genome assembly or the human pan-genome and important caveats associated with such a switch. We touch on arguably the hardest problem of NGS data analysis for medical genomics, namely, the annotation of genetic variants and their subsequent interpretation. We highlight the most challenging aspects of annotation and prioritization of both coding and non-coding variants. Finally, we demonstrate the persistent prevalence of pathogenic variants in the coding genome, and outline research directions that may enhance the efficiency of NGS-based disease diagnostics.
Additional Links: PMID-38271481
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38271481,
year = {2024},
author = {Barbitoff, YA and Ushakov, MO and Lazareva, TE and Nasykhova, YA and Glotov, AS and Predeus, AV},
title = {Bioinformatics of germline variant discovery for rare disease diagnostics: current approaches and remaining challenges.},
journal = {Briefings in bioinformatics},
volume = {25},
number = {2},
pages = {},
doi = {10.1093/bib/bbad508},
pmid = {38271481},
issn = {1477-4054},
support = {075-15-2021-1058//Ministry of Science and Higher Education of Russian Federation/ ; },
abstract = {Next-generation sequencing (NGS) has revolutionized the field of rare disease diagnostics. Whole exome and whole genome sequencing are now routinely used for diagnostic purposes; however, the overall diagnosis rate remains lower than expected. In this work, we review current approaches used for calling and interpretation of germline genetic variants in the human genome, and discuss the most important challenges that persist in the bioinformatic analysis of NGS data in medical genetics. We describe and attempt to quantitatively assess the remaining problems, such as the quality of the reference genome sequence, reproducible coverage biases, or variant calling accuracy in complex regions of the genome. We also discuss the prospects of switching to the complete human genome assembly or the human pan-genome and important caveats associated with such a switch. We touch on arguably the hardest problem of NGS data analysis for medical genomics, namely, the annotation of genetic variants and their subsequent interpretation. We highlight the most challenging aspects of annotation and prioritization of both coding and non-coding variants. Finally, we demonstrate the persistent prevalence of pathogenic variants in the coding genome, and outline research directions that may enhance the efficiency of NGS-based disease diagnostics.},
}
RevDate: 2024-01-25
Genomics empowering conservation action and improvement of celery in the face of climate change.
Planta, 259(2):42.
Integration of genomic approaches like whole genome sequencing, functional genomics, evolutionary genomics, and CRISPR/Cas9-based genome editing has accelerated the improvement of crop plants including leafy vegetables like celery in the face of climate change. The anthropogenic climate change is a real peril to the existence of life forms on our planet, including human and plant life. Climate change is predicted to be a significant threat to biodiversity and food security in the coming decades and is rapidly transforming global farming systems. To avoid the ghastly future in the face of climate change, the elucidation of shifts in the geographical range of plant species, species adaptation, and evolution is necessary for plant scientists to develop climate-resilient strategies. In the post-genomics era, the increasing availability of genomic resources and integration of multifaceted genomics elements is empowering biodiversity conservation action, restoration efforts, and identification of genomic regions adaptive to climate change. Genomics has accelerated the true characterization of crop wild relatives, genomic variations, and the development of climate-resilient varieties to ensure food security for 10 billion people by 2050. In this review, we have summarized the applications of multifaceted genomic tools, like conservation genomics, whole genome sequencing, functional genomics, genome editing, pangenomics, in the conservation and adaptation of plant species with a focus on celery, an aromatic and medicinal Apiaceae vegetable. We focus on how conservation scientists can utilize genomics and genomic data in conservation and improvement.
Additional Links: PMID-38270699
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38270699,
year = {2024},
author = {Singh, S and Singh, R and Priyadarsini, S and Ola, AL},
title = {Genomics empowering conservation action and improvement of celery in the face of climate change.},
journal = {Planta},
volume = {259},
number = {2},
pages = {42},
pmid = {38270699},
issn = {1432-2048},
abstract = {Integration of genomic approaches like whole genome sequencing, functional genomics, evolutionary genomics, and CRISPR/Cas9-based genome editing has accelerated the improvement of crop plants including leafy vegetables like celery in the face of climate change. The anthropogenic climate change is a real peril to the existence of life forms on our planet, including human and plant life. Climate change is predicted to be a significant threat to biodiversity and food security in the coming decades and is rapidly transforming global farming systems. To avoid the ghastly future in the face of climate change, the elucidation of shifts in the geographical range of plant species, species adaptation, and evolution is necessary for plant scientists to develop climate-resilient strategies. In the post-genomics era, the increasing availability of genomic resources and integration of multifaceted genomics elements is empowering biodiversity conservation action, restoration efforts, and identification of genomic regions adaptive to climate change. Genomics has accelerated the true characterization of crop wild relatives, genomic variations, and the development of climate-resilient varieties to ensure food security for 10 billion people by 2050. In this review, we have summarized the applications of multifaceted genomic tools, like conservation genomics, whole genome sequencing, functional genomics, genome editing, pangenomics, in the conservation and adaptation of plant species with a focus on celery, an aromatic and medicinal Apiaceae vegetable. We focus on how conservation scientists can utilize genomics and genomic data in conservation and improvement.},
}
RevDate: 2024-01-24
Genomic and phenotypic analysis of invasive Streptococcus suis isolated in Spain reveals genetic diversification and associated virulence traits.
Veterinary research, 55(1):11.
Streptococcus suis is a zoonotic pathogen that causes a major health problem in the pig production industry worldwide. Spain is one of the largest pig producers in the world. This work aimed to investigate the genetic and phenotypic features of invasive S. suis isolates recovered in Spain. A panel of 156 clinical isolates recovered from 13 Autonomous Communities, representing the major pig producers, were analysed. MLST and serotyping analysis revealed that most isolates (61.6%) were assigned to ST1 (26.3%), ST123 (18.6%), ST29 (9.6%), and ST3 (7.1%). Interestingly, 34 new STs were identified, indicating the emergence of novel genetic lineages. Serotypes 9 (27.6%) and 1 (21.8%) prevailed, followed by serotypes 7 (12.8%) and 2 (12.2%). Analysis of 13 virulence-associated genes showed significant associations between ST, serotype, virulence patterns, and clinical features, evidencing particular virulence traits associated with genetic clusters. The pangenome was generated, and the core genome was distributed in 7 Bayesian groups where each group included a variable set of over- and under-represented genes of different categories. The study provides comprehensive data and knowledge to improve the design of new vaccines, antimicrobial treatments, and bacterial typing approaches.
Additional Links: PMID-38268053
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38268053,
year = {2024},
author = {Uruén, C and Fernandez, A and Arnal, JL and Del Pozo, M and Amoribieta, MC and de Blas, I and Jurado, P and Calvo, JH and Gottschalk, M and González-Vázquez, LD and Arenas, M and Marín, CM and Arenas, J},
title = {Genomic and phenotypic analysis of invasive Streptococcus suis isolated in Spain reveals genetic diversification and associated virulence traits.},
journal = {Veterinary research},
volume = {55},
number = {1},
pages = {11},
pmid = {38268053},
issn = {1297-9716},
support = {PID2020-114617RB-100//Ministerio de Ciencia e Innovación/Agencia Española de investigación/ ; LMP58_21//Departamento de Educación, Cultura y Deporte, Gobierno de Aragón/ ; },
abstract = {Streptococcus suis is a zoonotic pathogen that causes a major health problem in the pig production industry worldwide. Spain is one of the largest pig producers in the world. This work aimed to investigate the genetic and phenotypic features of invasive S. suis isolates recovered in Spain. A panel of 156 clinical isolates recovered from 13 Autonomous Communities, representing the major pig producers, were analysed. MLST and serotyping analysis revealed that most isolates (61.6%) were assigned to ST1 (26.3%), ST123 (18.6%), ST29 (9.6%), and ST3 (7.1%). Interestingly, 34 new STs were identified, indicating the emergence of novel genetic lineages. Serotypes 9 (27.6%) and 1 (21.8%) prevailed, followed by serotypes 7 (12.8%) and 2 (12.2%). Analysis of 13 virulence-associated genes showed significant associations between ST, serotype, virulence patterns, and clinical features, evidencing particular virulence traits associated with genetic clusters. The pangenome was generated, and the core genome was distributed in 7 Bayesian groups where each group included a variable set of over- and under-represented genes of different categories. The study provides comprehensive data and knowledge to improve the design of new vaccines, antimicrobial treatments, and bacterial typing approaches.},
}
RevDate: 2024-01-24
Halomonas citrativorans sp. nov., Halomonas casei sp. nov. and Halomonas colorata sp. nov., isolated from French cheese rinds.
International journal of systematic and evolutionary microbiology, 74(1):.
Eight Gram-stain-negative bacterial strains were isolated from cheese rinds sampled in France. On the basis of 16S rRNA gene sequence analysis, all isolates were assigned to the genus Halomonas. Phylogenetic investigations, including 16S rRNA gene studies, multilocus sequence analysis, reconstruction of a pan-genome phylogenetic tree with the concatenated core-genome content and average nucleotide identity (ANI) calculations, revealed that they constituted three novel and well-supported clusters. The closest relative species, determined using the whole-genome sequences of the strains, were Halomonas zhanjiangensis for two groups of cheese strains, sharing 82.4 and 93.1 % ANI, and another cluster sharing 92.2 % ANI with the Halomonas profundi type strain. The strains isolated herein differed from the previously described species by ANI values <95 % and several biochemical, enzymatic and colony characteristics. The results of phenotypic, phylogenetic and chemotaxonomic analyses indicated that the isolates belonged to three novel Halomonas species, for which the names Halomonas citrativorans sp. nov., Halomonas casei sp. nov. and Halomonas colorata sp. nov. are proposed, with isolates FME63[T] (=DSM 113315[T]=CIRM-BIA2430[T]=CIP 111880[T]=LMG 32013[T]), FME64[T] (=DSM 113316[T]=CIRM-BIA2431[T]=CIP 111877[T]=LMG 32015[T]) and FME66[T] (=DSM 113318[T]=CIRM-BIA2433[T]=CIP 111876[T]=LMG 32014[T]) as type strains, respectively.
Additional Links: PMID-38265421
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38265421,
year = {2024},
author = {Kothe, CI and Monnet, C and Irlinger, F and Virsolvy, M and Frühling, A and Neumann-Schaal, M and Wolf, J and Renault, P},
title = {Halomonas citrativorans sp. nov., Halomonas casei sp. nov. and Halomonas colorata sp. nov., isolated from French cheese rinds.},
journal = {International journal of systematic and evolutionary microbiology},
volume = {74},
number = {1},
pages = {},
doi = {10.1099/ijsem.0.006234},
pmid = {38265421},
issn = {1466-5034},
abstract = {Eight Gram-stain-negative bacterial strains were isolated from cheese rinds sampled in France. On the basis of 16S rRNA gene sequence analysis, all isolates were assigned to the genus Halomonas. Phylogenetic investigations, including 16S rRNA gene studies, multilocus sequence analysis, reconstruction of a pan-genome phylogenetic tree with the concatenated core-genome content and average nucleotide identity (ANI) calculations, revealed that they constituted three novel and well-supported clusters. The closest relative species, determined using the whole-genome sequences of the strains, were Halomonas zhanjiangensis for two groups of cheese strains, sharing 82.4 and 93.1 % ANI, and another cluster sharing 92.2 % ANI with the Halomonas profundi type strain. The strains isolated herein differed from the previously described species by ANI values <95 % and several biochemical, enzymatic and colony characteristics. The results of phenotypic, phylogenetic and chemotaxonomic analyses indicated that the isolates belonged to three novel Halomonas species, for which the names Halomonas citrativorans sp. nov., Halomonas casei sp. nov. and Halomonas colorata sp. nov. are proposed, with isolates FME63[T] (=DSM 113315[T]=CIRM-BIA2430[T]=CIP 111880[T]=LMG 32013[T]), FME64[T] (=DSM 113316[T]=CIRM-BIA2431[T]=CIP 111877[T]=LMG 32015[T]) and FME66[T] (=DSM 113318[T]=CIRM-BIA2433[T]=CIP 111876[T]=LMG 32014[T]) as type strains, respectively.},
}
RevDate: 2024-01-23
Translation variation across genetic backgrounds reveals a post-transcriptional buffering signature in yeast.
Nucleic acids research pii:7585675 [Epub ahead of print].
Gene expression is known to vary among individuals, and this variability can impact the phenotypic diversity observed in natural populations. While the transcriptome and proteome have been extensively studied, little is known about the translation process itself. Here, we therefore performed ribosome and transcriptomic profiling on a genetically and ecologically diverse set of natural isolates of the Saccharomyces cerevisiae yeast. Interestingly, we found that the Euclidean distances between each profile and the expression fold changes in each pairwise isolate comparison were higher at the transcriptomic level. This observation clearly indicates that the transcriptional variation observed in the different isolates is buffered through a phenomenon known as post-transcriptional buffering at the translation level. Furthermore, this phenomenon seemed to have a specific signature by preferentially affecting essential genes as well as genes involved in complex-forming proteins, and low transcribed genes. We also explored the translation of the S. cerevisiae pangenome and found that the accessory genes related to introgression events displayed similar transcription and translation levels as the core genome. By contrast, genes acquired through horizontal gene transfer events tended to be less efficiently translated. Together, our results highlight both the extent and signature of the post-transcriptional buffering.
Additional Links: PMID-38261993
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38261993,
year = {2024},
author = {Teyssonniere, EM and Shichino, Y and Mito, M and Friedrich, A and Iwasaki, S and Schacherer, J},
title = {Translation variation across genetic backgrounds reveals a post-transcriptional buffering signature in yeast.},
journal = {Nucleic acids research},
volume = {},
number = {},
pages = {},
doi = {10.1093/nar/gkae030},
pmid = {38261993},
issn = {1362-4962},
support = {772505/ERC_/European Research Council/International ; S10 OD018174/CD/ODCDC CDC HHS/United States ; },
abstract = {Gene expression is known to vary among individuals, and this variability can impact the phenotypic diversity observed in natural populations. While the transcriptome and proteome have been extensively studied, little is known about the translation process itself. Here, we therefore performed ribosome and transcriptomic profiling on a genetically and ecologically diverse set of natural isolates of the Saccharomyces cerevisiae yeast. Interestingly, we found that the Euclidean distances between each profile and the expression fold changes in each pairwise isolate comparison were higher at the transcriptomic level. This observation clearly indicates that the transcriptional variation observed in the different isolates is buffered through a phenomenon known as post-transcriptional buffering at the translation level. Furthermore, this phenomenon seemed to have a specific signature by preferentially affecting essential genes as well as genes involved in complex-forming proteins, and low transcribed genes. We also explored the translation of the S. cerevisiae pangenome and found that the accessory genes related to introgression events displayed similar transcription and translation levels as the core genome. By contrast, genes acquired through horizontal gene transfer events tended to be less efficiently translated. Together, our results highlight both the extent and signature of the post-transcriptional buffering.},
}
RevDate: 2024-01-23
Pangenome reconstruction in rats enhances genotype-phenotype mapping and novel variant discovery.
bioRxiv : the preprint server for biology pii:2024.01.10.575041.
The HXB/BXH family of recombinant inbred rat strains is a unique genetic resource that has been extensively phenotyped over 25 years, resulting in a vast dataset of quantitative molecular and physiological phenotypes. We built a pangenome graph from 10x Genomics linked-read data for 31 recombinant inbred rats to study genetic variation and association mapping. The pangenome length was on average 2.4 times greater than the corresponding length of the reference mRatBN7.2, confirming the capture of substantial additional variation. We validated variants in challenging regions, including complex structural variants resolving into multiple haplotypes. Phenome-wide association analysis of validated SNPs uncovered variants associated with glucose/insulin levels and hippocampal gene expression. We propose an interaction between Pirl1l1, Cromogranine expression, TNF-α levels, and insulin regulation. This study demonstrates the utility of linked-read pangenomes for comprehensive variant detection and mapping phenotypic diversity in a widely used rat genetic reference panel.
Additional Links: PMID-38260597
Full Text:
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38260597,
year = {2024},
author = {Villani, F and Guarracino, A and Ward, RR and Green, T and Emms, M and Pravenec, M and Prins, P and Garrison, E and Williams, RW and Chen, H and Colonna, V},
title = {Pangenome reconstruction in rats enhances genotype-phenotype mapping and novel variant discovery.},
journal = {bioRxiv : the preprint server for biology},
volume = {},
number = {},
pages = {},
doi = {10.1101/2024.01.10.575041},
pmid = {38260597},
abstract = {The HXB/BXH family of recombinant inbred rat strains is a unique genetic resource that has been extensively phenotyped over 25 years, resulting in a vast dataset of quantitative molecular and physiological phenotypes. We built a pangenome graph from 10x Genomics linked-read data for 31 recombinant inbred rats to study genetic variation and association mapping. The pangenome length was on average 2.4 times greater than the corresponding length of the reference mRatBN7.2, confirming the capture of substantial additional variation. We validated variants in challenging regions, including complex structural variants resolving into multiple haplotypes. Phenome-wide association analysis of validated SNPs uncovered variants associated with glucose/insulin levels and hippocampal gene expression. We propose an interaction between Pirl1l1, Cromogranine expression, TNF-α levels, and insulin regulation. This study demonstrates the utility of linked-read pangenomes for comprehensive variant detection and mapping phenotypic diversity in a widely used rat genetic reference panel.},
}
RevDate: 2024-01-23
Global genetic diversity and Asian clades evolution: a phylogeographic study of Staphylococcus aureus sequence type 5.
Antimicrobial agents and chemotherapy [Epub ahead of print].
Staphylococcus aureus sequence type (ST) 5 has spread worldwide; however, phylogeographic studies on the evolution of global phylogenetic and Asian clades of ST5 are lacking. This study included 368 ST5 genome sequences, including 111 newly generated sequences. Primary phylogenetic analysis suggested that there are five clades, and geographical clustering of ST5 methicillin-resistant S. aureus (MRSA) was linked to the acquisition of S. aureus pathogenicity islands (SaPIs; enterotoxin gene island) and integration of the prophage φSa3. The most recent common ancestor of global S. aureus ST5 dates back to the mid-1940s, coinciding with the clinical introduction of penicillin. Bayesian phylogeographic inference allowed to ancestrally trace the Asian ST5 MRSA clade to Japan, which may have spread to major cities in China and Korea in the 1990s. Based on a pan-genome-wide association study, the emergence of Asian ST5 clades was attributed to the gain of prophages, SaPIs, and plasmids, as well as the coevolution of resistance genes. Clade IV displayed greater genomic diversity than the Asian MRSA clades. Collectively, our study provides in-depth insights into the global evolution of S. aureus ST5 mainly in China and the United States and reveals that different S. aureus ST5 clades have arisen independently in different parts of the world, with limited geographic dispersal across continents.
Additional Links: PMID-38259089
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38259089,
year = {2024},
author = {Chen, F and Yin, Y and Chen, H and Wang, R and Wang, S and Wang, H},
title = {Global genetic diversity and Asian clades evolution: a phylogeographic study of Staphylococcus aureus sequence type 5.},
journal = {Antimicrobial agents and chemotherapy},
volume = {},
number = {},
pages = {e0117523},
doi = {10.1128/aac.01175-23},
pmid = {38259089},
issn = {1098-6596},
abstract = {Staphylococcus aureus sequence type (ST) 5 has spread worldwide; however, phylogeographic studies on the evolution of global phylogenetic and Asian clades of ST5 are lacking. This study included 368 ST5 genome sequences, including 111 newly generated sequences. Primary phylogenetic analysis suggested that there are five clades, and geographical clustering of ST5 methicillin-resistant S. aureus (MRSA) was linked to the acquisition of S. aureus pathogenicity islands (SaPIs; enterotoxin gene island) and integration of the prophage φSa3. The most recent common ancestor of global S. aureus ST5 dates back to the mid-1940s, coinciding with the clinical introduction of penicillin. Bayesian phylogeographic inference allowed to ancestrally trace the Asian ST5 MRSA clade to Japan, which may have spread to major cities in China and Korea in the 1990s. Based on a pan-genome-wide association study, the emergence of Asian ST5 clades was attributed to the gain of prophages, SaPIs, and plasmids, as well as the coevolution of resistance genes. Clade IV displayed greater genomic diversity than the Asian MRSA clades. Collectively, our study provides in-depth insights into the global evolution of S. aureus ST5 mainly in China and the United States and reveals that different S. aureus ST5 clades have arisen independently in different parts of the world, with limited geographic dispersal across continents.},
}
RevDate: 2024-01-23
Biotechnological Key Genes of the Rhodococcus erythropolis MGMM8 Genome: Genes for Bioremediation, Antibiotics, Plant Protection, and Growth Stimulation.
Microorganisms, 12(1): pii:microorganisms12010088.
Anthropogenic pollution, including residues from the green revolution initially aimed at addressing food security and healthcare, has paradoxically exacerbated environmental challenges. The transition towards comprehensive green biotechnology and bioremediation, achieved with lower financial investment, hinges on microbial biotechnology, with the Rhodococcus genus emerging as a promising contender. The significance of fully annotating genome sequences lies in comprehending strain constituents, devising experimental protocols, and strategically deploying these strains to address pertinent issues using pivotal genes. This study revolves around Rhodococcus erythropolis MGMM8, an associate of winter wheat plants in the rhizosphere. Through the annotation of its chromosomal genome and subsequent comparison with other strains, its potential applications were explored. Using the antiSMASH server, 19 gene clusters were predicted, encompassing genes responsible for antibiotics and siderophores. Antibiotic resistance evaluation via the Comprehensive Antibiotic Resistance Database (CARD) identified five genes (vanW, vanY, RbpA, iri, and folC) that were parallel to strain CCM2595. Leveraging the NCBI Prokaryotic Genome Annotation Pipeline (PGAP) for biodegradation, heavy metal resistance, and remediation genes, the presence of chlorimuron-ethyl, formaldehyde, benzene-desulfurization degradation genes, and heavy metal-related genes (ACR3, arsC, corA, DsbA, modA, and recG) in MGMM8 was confirmed. Furthermore, quorum-quenching signal genes, critical for curbing biofilm formation and virulence elicited by quorum-sensing in pathogens, were also discerned within MGMM8's genome. In light of these predictions, the novel isolate MGMM8 warrants phenotypic assessment to gauge its potential in biocontrol and bioremediation. This evaluation extends to isolating active compounds for potential antimicrobial activities against pathogenic microorganisms. The comprehensive genome annotation process has facilitated the genetic characterization of MGMM8 and has solidified its potential as a biotechnological strain to address global anthropogenic predicaments.
Additional Links: PMID-38257915
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38257915,
year = {2023},
author = {Afordoanyi, DM and Akosah, YA and Shnakhova, L and Saparmyradov, K and Diabankana, RGC and Validov, S},
title = {Biotechnological Key Genes of the Rhodococcus erythropolis MGMM8 Genome: Genes for Bioremediation, Antibiotics, Plant Protection, and Growth Stimulation.},
journal = {Microorganisms},
volume = {12},
number = {1},
pages = {},
doi = {10.3390/microorganisms12010088},
pmid = {38257915},
issn = {2076-2607},
support = {RF-1930.61321X0001/15.IP.21.0020//Ministry of Education 362 and Science of the Russian Federation/ ; },
abstract = {Anthropogenic pollution, including residues from the green revolution initially aimed at addressing food security and healthcare, has paradoxically exacerbated environmental challenges. The transition towards comprehensive green biotechnology and bioremediation, achieved with lower financial investment, hinges on microbial biotechnology, with the Rhodococcus genus emerging as a promising contender. The significance of fully annotating genome sequences lies in comprehending strain constituents, devising experimental protocols, and strategically deploying these strains to address pertinent issues using pivotal genes. This study revolves around Rhodococcus erythropolis MGMM8, an associate of winter wheat plants in the rhizosphere. Through the annotation of its chromosomal genome and subsequent comparison with other strains, its potential applications were explored. Using the antiSMASH server, 19 gene clusters were predicted, encompassing genes responsible for antibiotics and siderophores. Antibiotic resistance evaluation via the Comprehensive Antibiotic Resistance Database (CARD) identified five genes (vanW, vanY, RbpA, iri, and folC) that were parallel to strain CCM2595. Leveraging the NCBI Prokaryotic Genome Annotation Pipeline (PGAP) for biodegradation, heavy metal resistance, and remediation genes, the presence of chlorimuron-ethyl, formaldehyde, benzene-desulfurization degradation genes, and heavy metal-related genes (ACR3, arsC, corA, DsbA, modA, and recG) in MGMM8 was confirmed. Furthermore, quorum-quenching signal genes, critical for curbing biofilm formation and virulence elicited by quorum-sensing in pathogens, were also discerned within MGMM8's genome. In light of these predictions, the novel isolate MGMM8 warrants phenotypic assessment to gauge its potential in biocontrol and bioremediation. This evaluation extends to isolating active compounds for potential antimicrobial activities against pathogenic microorganisms. The comprehensive genome annotation process has facilitated the genetic characterization of MGMM8 and has solidified its potential as a biotechnological strain to address global anthropogenic predicaments.},
}
RevDate: 2024-01-23
Genomics of Re-Emergent Aeromonas salmonicida in Atlantic Salmon Outbreaks.
Microorganisms, 12(1): pii:microorganisms12010064.
Furunculosis, caused by Aeromonas salmonicida, poses a significant threat to both salmonid and non-salmonid fish in diverse aquatic environments. This study explores the genomic intricacies of re-emergent A. salmonicida outbreaks in Atlantic salmon (Salmo salar). Previous clinical cases have exhibited pathological characteristics, such as periorbital hemorrhages and gastrointestinal abnormalities. Genomic sequencing of three Chilean isolates (ASA04, ASA05, and CIBA_5017) and 25 previously described genomes determined the pan-genome, phylogenomics, insertion sequences, and restriction-modification systems. Unique gene families have contributed to an improved understanding of the psychrophilic and mesophilic clades, while phylogenomic analysis has been used to identify mesophilic and psychrophilic strains, thereby further differentiating between typical and atypical psychrophilic isolates. Diverse insertion sequences and restriction-modification patterns have highlighted genomic structural differences, and virulence factor predictions can emphasize exotoxin disparities, especially between psychrophilic and mesophilic strains. Thus, a novel plasmid was characterized which emphasized the role of plasmids in virulence and antibiotic resistance. The analysis of antibiotic resistance factors revealed resistance against various drug classes in Chilean strains. Overall, this study elucidates the genomic dynamics of re-emergent A. salmonicida and provides novel insights into their virulence, antibiotic resistance, and population structure.
Additional Links: PMID-38257891
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38257891,
year = {2023},
author = {Godoy, M and Montes de Oca, M and Suarez, R and Martinez, A and Pontigo, JP and Caro, D and Kusch, K and Coca, Y and Bohle, H and Bayliss, S and Kibenge, M and Kibenge, F},
title = {Genomics of Re-Emergent Aeromonas salmonicida in Atlantic Salmon Outbreaks.},
journal = {Microorganisms},
volume = {12},
number = {1},
pages = {},
doi = {10.3390/microorganisms12010064},
pmid = {38257891},
issn = {2076-2607},
abstract = {Furunculosis, caused by Aeromonas salmonicida, poses a significant threat to both salmonid and non-salmonid fish in diverse aquatic environments. This study explores the genomic intricacies of re-emergent A. salmonicida outbreaks in Atlantic salmon (Salmo salar). Previous clinical cases have exhibited pathological characteristics, such as periorbital hemorrhages and gastrointestinal abnormalities. Genomic sequencing of three Chilean isolates (ASA04, ASA05, and CIBA_5017) and 25 previously described genomes determined the pan-genome, phylogenomics, insertion sequences, and restriction-modification systems. Unique gene families have contributed to an improved understanding of the psychrophilic and mesophilic clades, while phylogenomic analysis has been used to identify mesophilic and psychrophilic strains, thereby further differentiating between typical and atypical psychrophilic isolates. Diverse insertion sequences and restriction-modification patterns have highlighted genomic structural differences, and virulence factor predictions can emphasize exotoxin disparities, especially between psychrophilic and mesophilic strains. Thus, a novel plasmid was characterized which emphasized the role of plasmids in virulence and antibiotic resistance. The analysis of antibiotic resistance factors revealed resistance against various drug classes in Chilean strains. Overall, this study elucidates the genomic dynamics of re-emergent A. salmonicida and provides novel insights into their virulence, antibiotic resistance, and population structure.},
}
RevDate: 2024-01-23
Fulgor: a fast and compact k-mer index for large-scale matching and color queries.
Algorithms for molecular biology : AMB, 19(1):3.
The problem of sequence identification or matching-determining the subset of reference sequences from a given collection that are likely to contain a short, queried nucleotide sequence-is relevant for many important tasks in Computational Biology, such as metagenomics and pangenome analysis. Due to the complex nature of such analyses and the large scale of the reference collections a resource-efficient solution to this problem is of utmost importance. This poses the threefold challenge of representing the reference collection with a data structure that is efficient to query, has light memory usage, and scales well to large collections. To solve this problem, we describe an efficient colored de Bruijn graph index, arising as the combination of a k-mer dictionary with a compressed inverted index. The proposed index takes full advantage of the fact that unitigs in the colored compacted de Bruijn graph are monochromatic (i.e., all k-mers in a unitig have the same set of references of origin, or color). Specifically, the unitigs are kept in the dictionary in color order, thereby allowing for the encoding of the map from k-mers to their colors in as little as 1 + o(1) bits per unitig. Hence, one color per unitig is stored in the index with almost no space/time overhead. By combining this property with simple but effective compression methods for integer lists, the index achieves very small space. We implement these methods in a tool called Fulgor, and conduct an extensive experimental analysis to demonstrate the improvement of our tool over previous solutions. For example, compared to Themisto-the strongest competitor in terms of index space vs. query time trade-off-Fulgor requires significantly less space (up to 43% less space for a collection of 150,000 Salmonella enterica genomes), is at least twice as fast for color queries, and is 2-6[Formula: see text] faster to construct.
Additional Links: PMID-38254124
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38254124,
year = {2024},
author = {Fan, J and Khan, J and Singh, NP and Pibiri, GE and Patro, R},
title = {Fulgor: a fast and compact k-mer index for large-scale matching and color queries.},
journal = {Algorithms for molecular biology : AMB},
volume = {19},
number = {1},
pages = {3},
pmid = {38254124},
issn = {1748-7188},
support = {R01HG009937/NH/NIH HHS/United States ; },
abstract = {The problem of sequence identification or matching-determining the subset of reference sequences from a given collection that are likely to contain a short, queried nucleotide sequence-is relevant for many important tasks in Computational Biology, such as metagenomics and pangenome analysis. Due to the complex nature of such analyses and the large scale of the reference collections a resource-efficient solution to this problem is of utmost importance. This poses the threefold challenge of representing the reference collection with a data structure that is efficient to query, has light memory usage, and scales well to large collections. To solve this problem, we describe an efficient colored de Bruijn graph index, arising as the combination of a k-mer dictionary with a compressed inverted index. The proposed index takes full advantage of the fact that unitigs in the colored compacted de Bruijn graph are monochromatic (i.e., all k-mers in a unitig have the same set of references of origin, or color). Specifically, the unitigs are kept in the dictionary in color order, thereby allowing for the encoding of the map from k-mers to their colors in as little as 1 + o(1) bits per unitig. Hence, one color per unitig is stored in the index with almost no space/time overhead. By combining this property with simple but effective compression methods for integer lists, the index achieves very small space. We implement these methods in a tool called Fulgor, and conduct an extensive experimental analysis to demonstrate the improvement of our tool over previous solutions. For example, compared to Themisto-the strongest competitor in terms of index space vs. query time trade-off-Fulgor requires significantly less space (up to 43% less space for a collection of 150,000 Salmonella enterica genomes), is at least twice as fast for color queries, and is 2-6[Formula: see text] faster to construct.},
}
RevDate: 2024-01-22
Pangenome graphs improve the analysis of structural variants in rare genetic diseases.
Nature communications, 15(1):657.
Rare DNA alterations that cause heritable diseases are only partially resolvable by clinical next-generation sequencing due to the difficulty of detecting structural variation (SV) in all genomic contexts. Long-read, high fidelity genome sequencing (HiFi-GS) detects SVs with increased sensitivity and enables assembling personal and graph genomes. We leverage standard reference genomes, public assemblies (n = 94) and a large collection of HiFi-GS data from a rare disease program (Genomic Answers for Kids, GA4K, n = 574 assemblies) to build a graph genome representing a unified SV callset in GA4K, identify common variation and prioritize SVs that are more likely to cause genetic disease (MAF < 0.01). Using graphs, we obtain a higher level of reproducibility than the standard reference approach. We observe over 200,000 SV alleles unique to GA4K, including nearly 1000 rare variants that impact coding sequence. With improved specificity for rare SVs, we isolate 30 candidate SVs in phenotypically prioritized genes, including known disease SVs. We isolate a novel diagnostic SV in KMT2E, demonstrating use of personal assemblies coupled with pangenome graphs for rare disease genomics. The community may interrogate our pangenome with additional assemblies to discover new SVs within the allele frequency spectrum relevant to genetic diseases.
Additional Links: PMID-38253606
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38253606,
year = {2024},
author = {Groza, C and Schwendinger-Schreck, C and Cheung, WA and Farrow, EG and Thiffault, I and Lake, J and Rizzo, WB and Evrony, G and Curran, T and Bourque, G and Pastinen, T},
title = {Pangenome graphs improve the analysis of structural variants in rare genetic diseases.},
journal = {Nature communications},
volume = {15},
number = {1},
pages = {657},
pmid = {38253606},
issn = {2041-1723},
abstract = {Rare DNA alterations that cause heritable diseases are only partially resolvable by clinical next-generation sequencing due to the difficulty of detecting structural variation (SV) in all genomic contexts. Long-read, high fidelity genome sequencing (HiFi-GS) detects SVs with increased sensitivity and enables assembling personal and graph genomes. We leverage standard reference genomes, public assemblies (n = 94) and a large collection of HiFi-GS data from a rare disease program (Genomic Answers for Kids, GA4K, n = 574 assemblies) to build a graph genome representing a unified SV callset in GA4K, identify common variation and prioritize SVs that are more likely to cause genetic disease (MAF < 0.01). Using graphs, we obtain a higher level of reproducibility than the standard reference approach. We observe over 200,000 SV alleles unique to GA4K, including nearly 1000 rare variants that impact coding sequence. With improved specificity for rare SVs, we isolate 30 candidate SVs in phenotypically prioritized genes, including known disease SVs. We isolate a novel diagnostic SV in KMT2E, demonstrating use of personal assemblies coupled with pangenome graphs for rare disease genomics. The community may interrogate our pangenome with additional assemblies to discover new SVs within the allele frequency spectrum relevant to genetic diseases.},
}
RevDate: 2024-01-22
Description of Mycolicibacterium arenosum sp. nov. Isolated from Coastal Sand on the Yellow Sea Coast.
Current microbiology, 81(3):73.
A Gram-staining-positive, aerobic, non-spore-forming bacterium was isolated from coastal sand samples from Incheon in the Republic of Korea and designated as strain CAU 1645[T]. The optimum conditions for growth were observed at 30 °C in growth media containing 1% (w/v) NaCl at pH 9.0. The predominant respiratory quinone was MK-9 and the major fatty acids were C16:0, C17:1 w7c, and summed feature 7. Similarly, the 16S rRNA gene sequence exhibited the highest similarity with Mycolicibacterium bacteremicum DSM 45578[T] and Mycolicibacterium neoaurum JCM 6365[T], both of which exhibited similarity rates of 97.2%. The genomic DNA G+C content was 68.2%. The whole genome of strain CAU 1645[T] was obtained and annotated with annotation using RAST server. The pan-genome analysis was determined using Prokka, Roary, and Phandango. In the pan-genome analysis, the strain CAU 1645[T] shared 40 core genes with closely related Mycolicibacterium species, including the AcpM gene, the meromycolate extension acyl carrier protein involved in forming impermeable cell walls in mycobacteria. Therefore, our findings demonstrated that the isolate represents a novel species of the genus Mycolicibacterium, for which we propose the name Mycolicibacterium arenosum sp. nov. The type strain is CAU 1645[T] (= KCTC 49724[T] = MCCC 1K07087[T]).
Additional Links: PMID-38253726
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38253726,
year = {2024},
author = {Jeong, J and Ahn, S and Truong, TC and Kim, JH and Weerawongwiwat, V and Lee, JS and Yoon, JH and Sukhoom, A and Kim, W},
title = {Description of Mycolicibacterium arenosum sp. nov. Isolated from Coastal Sand on the Yellow Sea Coast.},
journal = {Current microbiology},
volume = {81},
number = {3},
pages = {73},
pmid = {38253726},
issn = {1432-0991},
support = {NIBR202102205//National Institute of Biological Resources/ ; NRF-2021R1C1C2003223//National Research Foundation of Korea/ ; 2017//Chung-Ang University/ ; },
abstract = {A Gram-staining-positive, aerobic, non-spore-forming bacterium was isolated from coastal sand samples from Incheon in the Republic of Korea and designated as strain CAU 1645[T]. The optimum conditions for growth were observed at 30 °C in growth media containing 1% (w/v) NaCl at pH 9.0. The predominant respiratory quinone was MK-9 and the major fatty acids were C16:0, C17:1 w7c, and summed feature 7. Similarly, the 16S rRNA gene sequence exhibited the highest similarity with Mycolicibacterium bacteremicum DSM 45578[T] and Mycolicibacterium neoaurum JCM 6365[T], both of which exhibited similarity rates of 97.2%. The genomic DNA G+C content was 68.2%. The whole genome of strain CAU 1645[T] was obtained and annotated with annotation using RAST server. The pan-genome analysis was determined using Prokka, Roary, and Phandango. In the pan-genome analysis, the strain CAU 1645[T] shared 40 core genes with closely related Mycolicibacterium species, including the AcpM gene, the meromycolate extension acyl carrier protein involved in forming impermeable cell walls in mycobacteria. Therefore, our findings demonstrated that the isolate represents a novel species of the genus Mycolicibacterium, for which we propose the name Mycolicibacterium arenosum sp. nov. The type strain is CAU 1645[T] (= KCTC 49724[T] = MCCC 1K07087[T]).},
}
RevDate: 2024-01-22
Corrigendum: Pangenome analysis of the genus Herbiconiux and proposal of four new species associated with Chinese medicinal plants.
Frontiers in microbiology, 14:1295710.
[This corrects the article DOI: 10.3389/fmicb.2023.1119226.].
Additional Links: PMID-38249481
Full Text:
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38249481,
year = {2023},
author = {Deng, Y and Jiang, ZM and Han, XF and Su, J and Yu, LY and Liu, WH and Zhang, YQ},
title = {Corrigendum: Pangenome analysis of the genus Herbiconiux and proposal of four new species associated with Chinese medicinal plants.},
journal = {Frontiers in microbiology},
volume = {14},
number = {},
pages = {1295710},
doi = {10.3389/fmicb.2023.1295710},
pmid = {38249481},
issn = {1664-302X},
abstract = {[This corrects the article DOI: 10.3389/fmicb.2023.1119226.].},
}
RevDate: 2024-01-21
Development of a SNP-based strain-identified method for Streptococcus thermophilus CICC 6038 and Lactobacillus delbrueckii ssp. bulgaricus CICC 6047 using pan-genomics analysis.
Journal of dairy science pii:S0022-0302(24)00014-6 [Epub ahead of print].
The health benefits conferred by probiotics is specific to individual probiotic strains, highlighting the importance of identifying specific strains for research and production purposes. Streptococcus thermophilus CICC 6038 and Lactobacillus delbrueckii ssp. bulgaricus CICC 6047 are exceedingly valuable for commercial use with an excellent mixed-culture fermentation. To differentiate these 2 strains from other S. thermophilus and L. delbrueckii ssp. bulgaricus, a specific, sensitive, accurate, rapid, convenient, and cost-effective method is required. In this study, we conducted a pan-genome analysis of S. thermophilus and L. delbrueckii ssp. bulgaricus to identify species-specific core genes, along with strain-specific single-nucleotide polymorphisms (SNPs). These genes were used to develop suitable PCR primers, and the conformity of sequence length and unique SNPs was confirmed by sequencing for qualitative identification at the strain level. The results demonstrated that SNPs analysis of PCR products derived from these primers could distinguish CICC 6038 and CICC 6047 accurately and reproducibly from the other strains of S. thermophilus and L. delbrueckii ssp. bulgaricus, respectively. The strain-specific PCR method based on SNPs herein is universally applicable for probiotics identification. It offers valuable insights into identifying probiotics at the strain level that is fit-for-purpose in quality control and compliance assessment of commercial dairy products.
Additional Links: PMID-38246550
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38246550,
year = {2024},
author = {Song, Z and Ge, Y and Yu, X and Liu, R and Liu, C and Cheng, K and Guo, L and Yao, S},
title = {Development of a SNP-based strain-identified method for Streptococcus thermophilus CICC 6038 and Lactobacillus delbrueckii ssp. bulgaricus CICC 6047 using pan-genomics analysis.},
journal = {Journal of dairy science},
volume = {},
number = {},
pages = {},
doi = {10.3168/jds.2023-23655},
pmid = {38246550},
issn = {1525-3198},
abstract = {The health benefits conferred by probiotics is specific to individual probiotic strains, highlighting the importance of identifying specific strains for research and production purposes. Streptococcus thermophilus CICC 6038 and Lactobacillus delbrueckii ssp. bulgaricus CICC 6047 are exceedingly valuable for commercial use with an excellent mixed-culture fermentation. To differentiate these 2 strains from other S. thermophilus and L. delbrueckii ssp. bulgaricus, a specific, sensitive, accurate, rapid, convenient, and cost-effective method is required. In this study, we conducted a pan-genome analysis of S. thermophilus and L. delbrueckii ssp. bulgaricus to identify species-specific core genes, along with strain-specific single-nucleotide polymorphisms (SNPs). These genes were used to develop suitable PCR primers, and the conformity of sequence length and unique SNPs was confirmed by sequencing for qualitative identification at the strain level. The results demonstrated that SNPs analysis of PCR products derived from these primers could distinguish CICC 6038 and CICC 6047 accurately and reproducibly from the other strains of S. thermophilus and L. delbrueckii ssp. bulgaricus, respectively. The strain-specific PCR method based on SNPs herein is universally applicable for probiotics identification. It offers valuable insights into identifying probiotics at the strain level that is fit-for-purpose in quality control and compliance assessment of commercial dairy products.},
}
RevDate: 2024-01-18
High genetic diversity and different type VI secretion systems in Enterobacter species revealed by comparative genomics analysis.
BMC microbiology, 24(1):26.
The human-pathogenic Enterobacter species are widely distributed in diverse environmental conditions, however, the understanding of the virulence factors and genetic variations within the genus is very limited. In this study, we performed comparative genomics analysis of 49 strains originated from diverse niches and belonged to eight Enterobacter species, in order to further understand the mechanism of adaption to the environment in Enterobacter. The results showed that they had an open pan-genome and high genomic diversity which allowed adaptation to distinctive ecological niches. We found the number of secretion systems was the highest among various virulence factors in these Enterobacter strains. Three types of T6SS gene clusters including T6SS-A, T6SS-B and T6SS-C were detected in most Enterobacter strains. T6SS-A and T6SS-B shared 13 specific core genes, but they had different gene structures, suggesting they probably have different biological functions. Notably, T6SS-C was restricted to E. cancerogenus. We detected a T6SS gene cluster, highly similar to T6SS-C (91.2%), in the remote related Citrobacter rodenitum, suggesting that this unique gene cluster was probably acquired by horizontal gene transfer. The genomes of Enterobacter strains possess high genetic diversity, limited number of conserved core genes, and multiple copies of T6SS gene clusters with differentiated structures, suggesting that the origins of T6SS were not by duplication instead by independent acquisition. These findings provide valuable information for better understanding of the functional features of Enterobacter species and their evolutionary relationships.
Additional Links: PMID-38238664
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38238664,
year = {2024},
author = {Peng, M and Lin, W and Zhou, A and Jiang, Z and Zhou, F and Wang, Z},
title = {High genetic diversity and different type VI secretion systems in Enterobacter species revealed by comparative genomics analysis.},
journal = {BMC microbiology},
volume = {24},
number = {1},
pages = {26},
pmid = {38238664},
issn = {1471-2180},
support = {32200094//National Natural Science Foundation of China/ ; PT012201//Hubei Key Laboratory of Biological Resources Protection and Utilization (Hubei Minzu University)/ ; 2022CFB674//Natural Science Foundation of Hubei Province/ ; },
abstract = {The human-pathogenic Enterobacter species are widely distributed in diverse environmental conditions, however, the understanding of the virulence factors and genetic variations within the genus is very limited. In this study, we performed comparative genomics analysis of 49 strains originated from diverse niches and belonged to eight Enterobacter species, in order to further understand the mechanism of adaption to the environment in Enterobacter. The results showed that they had an open pan-genome and high genomic diversity which allowed adaptation to distinctive ecological niches. We found the number of secretion systems was the highest among various virulence factors in these Enterobacter strains. Three types of T6SS gene clusters including T6SS-A, T6SS-B and T6SS-C were detected in most Enterobacter strains. T6SS-A and T6SS-B shared 13 specific core genes, but they had different gene structures, suggesting they probably have different biological functions. Notably, T6SS-C was restricted to E. cancerogenus. We detected a T6SS gene cluster, highly similar to T6SS-C (91.2%), in the remote related Citrobacter rodenitum, suggesting that this unique gene cluster was probably acquired by horizontal gene transfer. The genomes of Enterobacter strains possess high genetic diversity, limited number of conserved core genes, and multiple copies of T6SS gene clusters with differentiated structures, suggesting that the origins of T6SS were not by duplication instead by independent acquisition. These findings provide valuable information for better understanding of the functional features of Enterobacter species and their evolutionary relationships.},
}
RevDate: 2024-01-17
Diversification of gene content in the Mycobacterium tuberculosis complex is determined by phylogenetic and ecological signatures.
Microbiology spectrum [Epub ahead of print].
In this study, we analyzed the gene content of different ecotypes of the Mycobacterium tuberculosis complex (MTBC), the pathogens of tuberculosis. We found that changes in their gene content are associated with their ecological features, such as host preference. Gene loss was identified as the primary driver of these changes, which can vary even among different strains of the same ecotype. Our study also revealed that the gene content relatedness of these bacteria does not always mirror their evolutionary relationships. In addition, some genes of virulence can be variably lost among strains of the same MTBC ecotype, likely helping them to evade the immune system. Overall, our study highlights the importance of understanding how gene loss can lead to new adaptations in these bacteria and how different selective pressures may influence their genetic makeup.
Additional Links: PMID-38230932
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38230932,
year = {2024},
author = {Silva-Pereira, TT and Soler-Camargo, NC and Guimarães, AMS},
title = {Diversification of gene content in the Mycobacterium tuberculosis complex is determined by phylogenetic and ecological signatures.},
journal = {Microbiology spectrum},
volume = {},
number = {},
pages = {e0228923},
doi = {10.1128/spectrum.02289-23},
pmid = {38230932},
issn = {2165-0497},
abstract = {In this study, we analyzed the gene content of different ecotypes of the Mycobacterium tuberculosis complex (MTBC), the pathogens of tuberculosis. We found that changes in their gene content are associated with their ecological features, such as host preference. Gene loss was identified as the primary driver of these changes, which can vary even among different strains of the same ecotype. Our study also revealed that the gene content relatedness of these bacteria does not always mirror their evolutionary relationships. In addition, some genes of virulence can be variably lost among strains of the same MTBC ecotype, likely helping them to evade the immune system. Overall, our study highlights the importance of understanding how gene loss can lead to new adaptations in these bacteria and how different selective pressures may influence their genetic makeup.},
}
RevDate: 2024-01-17
Ecological significance of Candidatus ARS69 and Gemmatimonadota in the Arctic glacier foreland ecosystems.
Applied microbiology and biotechnology, 108(1):128.
The Gemmatimonadota phylum has been widely detected in diverse natural environments, yet their specific ecological roles in many habitats remain poorly investigated. Similarly, the Candidatus ARS69 phylum has been identified only in a few habitats, and literature on their metabolic functions is relatively scarce. In the present study, we investigated the ecological significance of phyla Ca. ARS69 and Gemmatimonadota in the Arctic glacier foreland (GF) ecosystems through genome-resolved metagenomics. We have reconstructed the first high-quality metagenome-assembled genome (MAG) belonging to Ca. ARS69 and 12 other MAGs belonging to phylum Gemmatimonadota from the three different Arctic GF samples. We further elucidated these two groups phylogenetic lineage and their metabolic function through phylogenomic and pangenomic analysis. The analysis showed that all the reconstructed MAGs potentially belonged to novel species. The MAGs belonged to Ca. ARS69 consist about 8296 gene clusters, of which only about 8% of single-copy core genes (n = 980) were shared among them. The study also revealed the potential ecological role of Ca. ARS69 is associated with carbon fixation, denitrification, sulfite oxidation, and reduction biochemical processes in the GF ecosystems. Similarly, the study demonstrates the widespread distribution of different classes of Gemmatimonadota across wide ranges of ecosystems and their metabolic functions, including in the polar region. KEY POINTS: • Glacier foreland ecosystems act as a natural laboratory to study microbial community structure. • We have reconstructed 13 metagenome-assembled genomes from the soil samples. • All the reconstructed MAGs belonged to novel species with different metabolic processes. • Ca. ARS69 and Gemmatimonadota MAGs were found to participate in carbon fixation and denitrification processes.
Additional Links: PMID-38229335
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38229335,
year = {2024},
author = {Venkatachalam, S and Jabir, T and Vipindas, PV and Krishnan, KP},
title = {Ecological significance of Candidatus ARS69 and Gemmatimonadota in the Arctic glacier foreland ecosystems.},
journal = {Applied microbiology and biotechnology},
volume = {108},
number = {1},
pages = {128},
pmid = {38229335},
issn = {1432-0614},
abstract = {The Gemmatimonadota phylum has been widely detected in diverse natural environments, yet their specific ecological roles in many habitats remain poorly investigated. Similarly, the Candidatus ARS69 phylum has been identified only in a few habitats, and literature on their metabolic functions is relatively scarce. In the present study, we investigated the ecological significance of phyla Ca. ARS69 and Gemmatimonadota in the Arctic glacier foreland (GF) ecosystems through genome-resolved metagenomics. We have reconstructed the first high-quality metagenome-assembled genome (MAG) belonging to Ca. ARS69 and 12 other MAGs belonging to phylum Gemmatimonadota from the three different Arctic GF samples. We further elucidated these two groups phylogenetic lineage and their metabolic function through phylogenomic and pangenomic analysis. The analysis showed that all the reconstructed MAGs potentially belonged to novel species. The MAGs belonged to Ca. ARS69 consist about 8296 gene clusters, of which only about 8% of single-copy core genes (n = 980) were shared among them. The study also revealed the potential ecological role of Ca. ARS69 is associated with carbon fixation, denitrification, sulfite oxidation, and reduction biochemical processes in the GF ecosystems. Similarly, the study demonstrates the widespread distribution of different classes of Gemmatimonadota across wide ranges of ecosystems and their metabolic functions, including in the polar region. KEY POINTS: • Glacier foreland ecosystems act as a natural laboratory to study microbial community structure. • We have reconstructed 13 metagenome-assembled genomes from the soil samples. • All the reconstructed MAGs belonged to novel species with different metabolic processes. • Ca. ARS69 and Gemmatimonadota MAGs were found to participate in carbon fixation and denitrification processes.},
}
RevDate: 2024-01-15
Comparative pangenome analysis of Aspergillus flavus and Aspergillus oryzae reveals their phylogenetic, genomic, and metabolic homogeneity.
Food microbiology, 119:104435.
Aspergillus flavus and Aspergillus oryzae are closely related fungal species with contrasting roles in food safety and fermentation. To comprehensively investigate their phylogenetic, genomic, and metabolic characteristics, we conducted an extensive comparative pangenome analysis using complete, dereplicated genome sets for both species. Phylogenetic analyses, employing both the entirety of the identified single-copy orthologous genes and six housekeeping genes commonly used for fungal classification, did not reveal clear differentiation between A. flavus and A. oryzae genomes. Upon analyzing the aflatoxin biosynthesis gene clusters within the genomes, we observed that non-aflatoxin-producing strains were dispersed throughout the phylogenetic tree, encompassing both A. flavus and A. oryzae strains. This suggests that aflatoxin production is not a distinguishing trait between the two species. Furthermore, A. oryzae and A. flavus strains displayed remarkably similar genomic attributes, including genome sizes, gene contents, and G + C contents, as well as metabolic features and pathways. The profiles of CAZyme genes and secondary metabolite biosynthesis gene clusters within the genomes of both species further highlight their similarity. Collectively, these findings challenge the conventional differentiation of A. flavus and A. oryzae as distinct species and highlight their phylogenetic, genomic, and metabolic homogeneity, potentially indicating that they may indeed belong to the same species.
Additional Links: PMID-38225047
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38225047,
year = {2024},
author = {Han, DM and Baek, JH and Choi, DG and Jeon, MS and Eyun, SI and Jeon, CO},
title = {Comparative pangenome analysis of Aspergillus flavus and Aspergillus oryzae reveals their phylogenetic, genomic, and metabolic homogeneity.},
journal = {Food microbiology},
volume = {119},
number = {},
pages = {104435},
doi = {10.1016/j.fm.2023.104435},
pmid = {38225047},
issn = {1095-9998},
abstract = {Aspergillus flavus and Aspergillus oryzae are closely related fungal species with contrasting roles in food safety and fermentation. To comprehensively investigate their phylogenetic, genomic, and metabolic characteristics, we conducted an extensive comparative pangenome analysis using complete, dereplicated genome sets for both species. Phylogenetic analyses, employing both the entirety of the identified single-copy orthologous genes and six housekeeping genes commonly used for fungal classification, did not reveal clear differentiation between A. flavus and A. oryzae genomes. Upon analyzing the aflatoxin biosynthesis gene clusters within the genomes, we observed that non-aflatoxin-producing strains were dispersed throughout the phylogenetic tree, encompassing both A. flavus and A. oryzae strains. This suggests that aflatoxin production is not a distinguishing trait between the two species. Furthermore, A. oryzae and A. flavus strains displayed remarkably similar genomic attributes, including genome sizes, gene contents, and G + C contents, as well as metabolic features and pathways. The profiles of CAZyme genes and secondary metabolite biosynthesis gene clusters within the genomes of both species further highlight their similarity. Collectively, these findings challenge the conventional differentiation of A. flavus and A. oryzae as distinct species and highlight their phylogenetic, genomic, and metabolic homogeneity, potentially indicating that they may indeed belong to the same species.},
}
RevDate: 2024-01-15
Genome-based analyses to learn from and about Paenibacillus sonchi genomovar Riograndensis SBR5T.
Genetics and molecular biology, 46(3 Suppl 1):e20230115 pii:S1415-47572023000600113.
Paenibacillus sonchi genomovar Riograndensis SBR5T is a plant growth-promoting rhizobacterium (PGPR) isolated in the Brazilian state of Rio Grande do Sul from the rhizosphere of Triticum aestivum. It fixes nitrogen, produces siderophores as well as the phytohormone indole-3-acetic acid, solubilizes phosphate and displays antagonist activity against Listeria monocytogenes and Pectobacterium carotovorum. Comprehensive omics analysis and the development of genetic tools are key to characterizing and engineering such non-model microorganisms. Therefore, the complete genome of SBR5T was sequenced, and shown to encode 6,705 proteins, 87 tRNAs, and 27 rRNAs and it enabled a landscape transcriptome analysis that unveiled conserved transcriptional and translational patterns and characterized operon structures and riboswitches. The pangenome of P. sonchi species is open with a stable core pangenome. At the same time, the analysis of genes coding for nitrogenases revealed that the trait of nitrogen fixation is sparse within the Paenibacillaceae family and the presence of Fe-only nitrogenase in the P. sonchi group was exclusive to SBR5T. The development of genetic tools for SBR5T enabled genetic transformation, plasmid construction for constitutive and inducible gene expression, and gene repression using the CRISPRi system. Altogether, the work with P. sonchi can guide the study of non-model bacteria with economic potential.
Additional Links: PMID-38224489
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38224489,
year = {2024},
author = {Wendisch, VF and Brito, LF and Passaglia, LMP},
title = {Genome-based analyses to learn from and about Paenibacillus sonchi genomovar Riograndensis SBR5T.},
journal = {Genetics and molecular biology},
volume = {46},
number = {3 Suppl 1},
pages = {e20230115},
doi = {10.1590/1678-4685-GMB-2023-0115},
pmid = {38224489},
issn = {1415-4757},
abstract = {Paenibacillus sonchi genomovar Riograndensis SBR5T is a plant growth-promoting rhizobacterium (PGPR) isolated in the Brazilian state of Rio Grande do Sul from the rhizosphere of Triticum aestivum. It fixes nitrogen, produces siderophores as well as the phytohormone indole-3-acetic acid, solubilizes phosphate and displays antagonist activity against Listeria monocytogenes and Pectobacterium carotovorum. Comprehensive omics analysis and the development of genetic tools are key to characterizing and engineering such non-model microorganisms. Therefore, the complete genome of SBR5T was sequenced, and shown to encode 6,705 proteins, 87 tRNAs, and 27 rRNAs and it enabled a landscape transcriptome analysis that unveiled conserved transcriptional and translational patterns and characterized operon structures and riboswitches. The pangenome of P. sonchi species is open with a stable core pangenome. At the same time, the analysis of genes coding for nitrogenases revealed that the trait of nitrogen fixation is sparse within the Paenibacillaceae family and the presence of Fe-only nitrogenase in the P. sonchi group was exclusive to SBR5T. The development of genetic tools for SBR5T enabled genetic transformation, plasmid construction for constitutive and inducible gene expression, and gene repression using the CRISPRi system. Altogether, the work with P. sonchi can guide the study of non-model bacteria with economic potential.},
}
RevDate: 2024-01-13
Novel vaccine candidates of Bordetella pertussis identified by reverse vaccinology.
Biologicals : journal of the International Association of Biological Standardization, 85:101740 pii:S1045-1056(23)00079-9 [Epub ahead of print].
Whooping cough is a disease caused by Bordetella pertussis, whose morbidity has increased, motivating the improvement of current vaccines. Reverse vaccinology is a strategy that helps identify proteins with good characteristics fast and with fewer resources. In this work, we applied reverse vaccinology to study the B. pertussis proteome and pangenome with several in-silico tools. We analyzed the B. pertussis Tohama I proteome with NERVE software and compared 234 proteins with B. parapertussis, B. bronchiseptica, and B. holmessi. VaxiJen was used to calculate an antigenicity value; our threshold was 0.6, selecting 84 proteins. The candidates were depurated and grouped in eight family proteins to select representative candidates, according to bibliographic information and their immunological response predicted with ABCpred, Bcepred, IgPred, and C-ImmSim. Additionally, a pangenome study was conducted with 603 B. pertussis strains and PanRV software, identifying 3421 core proteins that were analyzed to select the best candidates. Finally, we selected 15 proteins from the proteome study and seven proteins from the pangenome analysis as good vaccine candidates.
Additional Links: PMID-38217963
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38217963,
year = {2024},
author = {Monterrubio-López, GP and Llamas-Monroy, JL and Martínez-Gómez, ÁA and Delgadillo-Gutiérrez, K},
title = {Novel vaccine candidates of Bordetella pertussis identified by reverse vaccinology.},
journal = {Biologicals : journal of the International Association of Biological Standardization},
volume = {85},
number = {},
pages = {101740},
doi = {10.1016/j.biologicals.2023.101740},
pmid = {38217963},
issn = {1095-8320},
abstract = {Whooping cough is a disease caused by Bordetella pertussis, whose morbidity has increased, motivating the improvement of current vaccines. Reverse vaccinology is a strategy that helps identify proteins with good characteristics fast and with fewer resources. In this work, we applied reverse vaccinology to study the B. pertussis proteome and pangenome with several in-silico tools. We analyzed the B. pertussis Tohama I proteome with NERVE software and compared 234 proteins with B. parapertussis, B. bronchiseptica, and B. holmessi. VaxiJen was used to calculate an antigenicity value; our threshold was 0.6, selecting 84 proteins. The candidates were depurated and grouped in eight family proteins to select representative candidates, according to bibliographic information and their immunological response predicted with ABCpred, Bcepred, IgPred, and C-ImmSim. Additionally, a pangenome study was conducted with 603 B. pertussis strains and PanRV software, identifying 3421 core proteins that were analyzed to select the best candidates. Finally, we selected 15 proteins from the proteome study and seven proteins from the pangenome analysis as good vaccine candidates.},
}
RevDate: 2024-01-12
Genome-wide association study reveals serovar-associated genetic loci in Riemerella anatipestifer.
BMC genomics, 25(1):57.
BACKGROUND: The disease caused by Riemerella anatipestifer (R. anatipestifer, RA) results in large economic losses to the global duck industry every year. Serovar-related genomic variation, such as the O-antigen and capsular polysaccharide (CPS) gene clusters, has been widely used for serotyping in many gram-negative bacteria. RA has been classified into at least 21 serovars based on slide agglutination, but the molecular basis of serotyping is unknown. In this study, we performed a pan-genome-wide association study (Pan-GWAS) to identify the genetic loci associated with RA serovars.
RESULTS: The results revealed a significant association between the putative CPS synthesis gene locus and the serological phenotype. Further characterization of the CPS gene clusters in 11 representative serovar strains indicated that they were highly diverse and serovar-specific. The CPS gene cluster contained the key genes wzx and wzy, which are involved in the Wzx/Wzy-dependent pathway of CPS synthesis. Similar CPS loci have been found in some other species within the family Weeksellaceae. We have also shown that deletion of the wzy gene in RA results in capsular defects and cross-agglutination.
CONCLUSIONS: This study indicates that the CPS synthesis gene cluster of R. anatipestifer is a serotype-specific genetic locus. Importantly, our finding provides a new perspective for the systematic analysis of the genetic basis of the R anatipestifer serovars and a potential target for establishing a complete molecular serotyping scheme.
Additional Links: PMID-38216873
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38216873,
year = {2024},
author = {Yang, Z and Yang, X and Wang, M and Jia, R and Chen, S and Liu, M and Zhao, X and Yang, Q and Wu, Y and Zhang, S and Huang, J and Ou, X and Mao, S and Gao, Q and Sun, D and Tian, B and Zhu, D and Cheng, A},
title = {Genome-wide association study reveals serovar-associated genetic loci in Riemerella anatipestifer.},
journal = {BMC genomics},
volume = {25},
number = {1},
pages = {57},
pmid = {38216873},
issn = {1471-2164},
abstract = {BACKGROUND: The disease caused by Riemerella anatipestifer (R. anatipestifer, RA) results in large economic losses to the global duck industry every year. Serovar-related genomic variation, such as the O-antigen and capsular polysaccharide (CPS) gene clusters, has been widely used for serotyping in many gram-negative bacteria. RA has been classified into at least 21 serovars based on slide agglutination, but the molecular basis of serotyping is unknown. In this study, we performed a pan-genome-wide association study (Pan-GWAS) to identify the genetic loci associated with RA serovars.
RESULTS: The results revealed a significant association between the putative CPS synthesis gene locus and the serological phenotype. Further characterization of the CPS gene clusters in 11 representative serovar strains indicated that they were highly diverse and serovar-specific. The CPS gene cluster contained the key genes wzx and wzy, which are involved in the Wzx/Wzy-dependent pathway of CPS synthesis. Similar CPS loci have been found in some other species within the family Weeksellaceae. We have also shown that deletion of the wzy gene in RA results in capsular defects and cross-agglutination.
CONCLUSIONS: This study indicates that the CPS synthesis gene cluster of R. anatipestifer is a serotype-specific genetic locus. Importantly, our finding provides a new perspective for the systematic analysis of the genetic basis of the R anatipestifer serovars and a potential target for establishing a complete molecular serotyping scheme.},
}
RevDate: 2024-01-12
Genomic resources for a historical collection of cultivated two-row European spring barley genotypes.
Scientific data, 11(1):66.
Barley genomic resources are increasing rapidly, with the publication of a barley pangenome as one of the latest developments. Two-row spring barley cultivars are intensely studied as they are the source of high-quality grain for malting and distilling. Here we provide data from a European two-row spring barley population containing 209 different genotypes registered for the UK market between 1830 to 2014. The dataset encompasses RNA-sequencing data from six different tissues across a range of barley developmental stages, phenotypic datasets from two consecutive years of field-grown trials in the United Kingdom, Germany and the USA; and whole genome shotgun sequencing from all cultivars, which was used to complement the RNA-sequencing data for variant calling. The outcomes are a filtered SNP marker file, a phenotypic database and a large gene expression dataset providing a comprehensive resource which allows for downstream analyses like genome wide association studies or expression associations.
Additional Links: PMID-38216606
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38216606,
year = {2024},
author = {Schreiber, M and Wonneberger, R and Haaning, AM and Coulter, M and Russell, J and Himmelbach, A and Fiebig, A and Muehlbauer, GJ and Stein, N and Waugh, R},
title = {Genomic resources for a historical collection of cultivated two-row European spring barley genotypes.},
journal = {Scientific data},
volume = {11},
number = {1},
pages = {66},
pmid = {38216606},
issn = {2052-4463},
support = {BB/S004610/1//RCUK | Biotechnology and Biological Sciences Research Council (BBSRC)/ ; BB/S004610/1//RCUK | Biotechnology and Biological Sciences Research Council (BBSRC)/ ; BB/S004610/1//RCUK | Biotechnology and Biological Sciences Research Council (BBSRC)/ ; BB/S004610/1//RCUK | Biotechnology and Biological Sciences Research Council (BBSRC)/ ; MU 3589/1-1//Deutsche Forschungsgemeinschaft (German Research Foundation)/ ; MU 3589/1-1//Deutsche Forschungsgemeinschaft (German Research Foundation)/ ; 1844331//National Science Foundation (NSF)/ ; 1844331//National Science Foundation (NSF)/ ; },
abstract = {Barley genomic resources are increasing rapidly, with the publication of a barley pangenome as one of the latest developments. Two-row spring barley cultivars are intensely studied as they are the source of high-quality grain for malting and distilling. Here we provide data from a European two-row spring barley population containing 209 different genotypes registered for the UK market between 1830 to 2014. The dataset encompasses RNA-sequencing data from six different tissues across a range of barley developmental stages, phenotypic datasets from two consecutive years of field-grown trials in the United Kingdom, Germany and the USA; and whole genome shotgun sequencing from all cultivars, which was used to complement the RNA-sequencing data for variant calling. The outcomes are a filtered SNP marker file, a phenotypic database and a large gene expression dataset providing a comprehensive resource which allows for downstream analyses like genome wide association studies or expression associations.},
}
RevDate: 2024-01-12
Cellulomonas alba sp. nov. and Cellulomonas edaphi sp. nov., isolated from wetland soils.
International journal of systematic and evolutionary microbiology, 74(1):.
Two novel strains were isolated from wetland soils in Goyang, Republic of Korea. The two Gram-stain-positive, facultatively anaerobic, rod-shaped bacterial-type strains were designated MW4[T] and MW9[T]. Phylogenomic analysis based on whole-genome sequences suggested that both strains belonged to the genus Cellulomonas. The cells of strain MW4[T] were non-motile and grew at 20-40 °C (optimum, 35 °C), at pH 6.0-10.0 (optimum, pH 8.0) and in the presence of 0-1.0% NaCl (optimum, 0 %). The cells of strain MW9[T] were non-motile and grew at 20-40 °C (optimum, 35 °C), at pH 5.0-9.0 (optimum, pH 8.0) and in the presence of 0-1.0% NaCl (optimum, 0 %). The average nucleotide identity (77.1-88.1 %) and digital DNA-DNA hybridization values (21.0-34.8 %) between the two novel strains and with their closely related strains fell within the range for the genus Cellulomonas. The novel strains MW4[T] and MW9[T] and reference strains possessed alkane synthesis gene clusters (oleA, oleB, oleC and oleD). Phylogenomic, phylogenetic, average nucleotide identity, digital DNA-DNA hybridization, physiological and biochemical data indicated that the novel strains were distinct from other members of the family Cellulomonadaceae. We propose the names Cellulomonas alba sp. nov. (type strain MW4[T]=KACC 23260[T]=TBRC 17645[T]) and Cellulomons edaphi sp. nov. (type strain MW9[T]=KACC 23261[T]=TBRC 17646[T]) for the two strains.
Additional Links: PMID-38214698
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38214698,
year = {2024},
author = {Park, S and Kim, I and Chhetri, G and Jung, Y and Woo, H and Seo, T},
title = {Cellulomonas alba sp. nov. and Cellulomonas edaphi sp. nov., isolated from wetland soils.},
journal = {International journal of systematic and evolutionary microbiology},
volume = {74},
number = {1},
pages = {},
doi = {10.1099/ijsem.0.006235},
pmid = {38214698},
issn = {1466-5034},
abstract = {Two novel strains were isolated from wetland soils in Goyang, Republic of Korea. The two Gram-stain-positive, facultatively anaerobic, rod-shaped bacterial-type strains were designated MW4[T] and MW9[T]. Phylogenomic analysis based on whole-genome sequences suggested that both strains belonged to the genus Cellulomonas. The cells of strain MW4[T] were non-motile and grew at 20-40 °C (optimum, 35 °C), at pH 6.0-10.0 (optimum, pH 8.0) and in the presence of 0-1.0% NaCl (optimum, 0 %). The cells of strain MW9[T] were non-motile and grew at 20-40 °C (optimum, 35 °C), at pH 5.0-9.0 (optimum, pH 8.0) and in the presence of 0-1.0% NaCl (optimum, 0 %). The average nucleotide identity (77.1-88.1 %) and digital DNA-DNA hybridization values (21.0-34.8 %) between the two novel strains and with their closely related strains fell within the range for the genus Cellulomonas. The novel strains MW4[T] and MW9[T] and reference strains possessed alkane synthesis gene clusters (oleA, oleB, oleC and oleD). Phylogenomic, phylogenetic, average nucleotide identity, digital DNA-DNA hybridization, physiological and biochemical data indicated that the novel strains were distinct from other members of the family Cellulomonadaceae. We propose the names Cellulomonas alba sp. nov. (type strain MW4[T]=KACC 23260[T]=TBRC 17645[T]) and Cellulomons edaphi sp. nov. (type strain MW9[T]=KACC 23261[T]=TBRC 17646[T]) for the two strains.},
}
RevDate: 2024-01-12
Oryza CLIMtools: A Genome-Environment Association Resource Reveals Adaptive Roles for Heterotrimeric G Proteins in the Regulation of Rice Agronomic Traits.
Plant communications pii:S2590-3462(24)00033-6 [Epub ahead of print].
Modern crop varieties display a degree of mismatch between their current distributions and the suitability of the local climate for their productivity. To this end, we present Oryza CLIMtools (https://gramene.org/CLIMtools/oryza_v1.0/), the first resource for pan-genome prediction of climate-associated genetic variants in a crop species. Oryza CLIMtools consists of interactive web-based databases that allow the user to: i) explore the local environments of traditional rice varieties (landraces) in South-Eastern Asia, and; ii) investigate the environment by genome associations for 658 Indica and 283 Japonica rice landrace accessions collected from georeferenced local environments and included in the 3K Rice Genomes Project. We exemplify the value of these resources, identifying an interplay between flowering time and temperature in the local environment that is facilitated by adaptive natural variation in OsHD2 and disrupted by a natural variant in OsSOC1. Prior QTL analysis has suggested the importance of heterotrimeric G proteins in the control of agronomic traits. Accordingly, we analyzed the climate associations of natural variants in the different heterotrimeric G protein subunits. We identified a coordinated role of G proteins in adaptation to the prevailing Potential Evapotranspiration gradient and their regulation of key agronomic traits including plant height and seed and panicle length. We conclude by highlighting the prospect of targeting heterotrimeric G proteins to produce crops that are climate resilient.
Additional Links: PMID-38213027
Publisher:
PubMed:
Citation:
show bibtex listing
hide bibtex listing
@article {pmid38213027,
year = {2024},
author = {Ferrero-Serrano, Á and Chakravorty, D and Kirven, KJ and Assmann, SM},
title = {Oryza CLIMtools: A Genome-Environment Association Resource Reveals Adaptive Roles for Heterotrimeric G Proteins in the Regulation of Rice Agronomic Traits.},
journal = {Plant communications},
volume = {},
number = {},
pages = {100813},
doi = {10.1016/j.xplc.2024.100813},
pmid = {38213027},
issn = {2590-3462},
abstract = {Modern crop varieties display a degree of mismatch between their current distributions and the suitability of the local climate for their productivity. To this end, we present Oryza CLIMtools (https://gramene.org/CLIMtools/oryza_v1.0/), the first resource for pan-genome prediction of climate-associated genetic variants in a crop species. Oryza CLIMtools consists of interactive web-based databases that allow the user to: i) explore the local environments of traditional rice varieties (landraces) in South-Eastern Asia, and; ii) investigate the environment by genome associations for 658 Indica and 283 Japonica rice landrace accessions collected from georeferenced local environments and included in the 3K Rice Genomes Project. We exemplify the value of these resources, identifying an interplay between flowering time and temperature in the local environment that is facilitated by adaptive natural variation in OsHD2 and disrupted by a natural variant in OsSOC1. Prior QTL analysis has suggested the importance of heterotrimeric G proteins in the control of agronomic traits. Accordingly, we analyzed the climate associations of natural variants in the different heterotrimeric G protein subunits. We identified a coordinated role of G proteins in adaptation to the prevailing Potential Evapotranspiration gradient and their regulation of key agronomic traits including plant height and seed and panicle length. We conclude by highlighting the prospect of targeting heterotrimeric G proteins to produce crops that are climate resilient.},
}
▼ ▼ LOAD NEXT 100 CITATIONS
RJR Experience and Expertise
Researcher
Robbins holds BS, MS, and PhD degrees in the life sciences. He served as a tenured faculty member in the Zoology and Biological Science departments at Michigan State University. He is currently exploring the intersection between genomics, microbial ecology, and biodiversity — an area that promises to transform our understanding of the biosphere.
Educator
Robbins has extensive experience in college-level education: At MSU he taught introductory biology, genetics, and population genetics. At JHU, he was an instructor for a special course on biological database design. At FHCRC, he team-taught a graduate-level course on the history of genetics. At Bellevue College he taught medical informatics.
Administrator
Robbins has been involved in science administration at both the federal and the institutional levels. At NSF he was a program officer for database activities in the life sciences, at DOE he was a program officer for information infrastructure in the human genome project. At the Fred Hutchinson Cancer Research Center, he served as a vice president for fifteen years.
Technologist
Robbins has been involved with information technology since writing his first Fortran program as a college student. At NSF he was the first program officer for database activities in the life sciences. At JHU he held an appointment in the CS department and served as director of the informatics core for the Genome Data Base. At the FHCRC he was VP for Information Technology.
Publisher
While still at Michigan State, Robbins started his first publishing venture, founding a small company that addressed the short-run publishing needs of instructors in very large undergraduate classes. For more than 20 years, Robbins has been operating The Electronic Scholarly Publishing Project, a web site dedicated to the digital publishing of critical works in science, especially classical genetics.
Speaker
Robbins is well-known for his speaking abilities and is often called upon to provide keynote or plenary addresses at international meetings. For example, in July, 2012, he gave a well-received keynote address at the Global Biodiversity Informatics Congress, sponsored by GBIF and held in Copenhagen. The slides from that talk can be seen HERE.
Facilitator
Robbins is a skilled meeting facilitator. He prefers a participatory approach, with part of the meeting involving dynamic breakout groups, created by the participants in real time: (1) individuals propose breakout groups; (2) everyone signs up for one (or more) groups; (3) the groups with the most interested parties then meet, with reports from each group presented and discussed in a subsequent plenary session.
Designer
Robbins has been engaged with photography and design since the 1960s, when he worked for a professional photography laboratory. He now prefers digital photography and tools for their precision and reproducibility. He designed his first web site more than 20 years ago and he personally designed and implemented this web site. He engages in graphic design as a hobby.
RJR Picks from Around the Web (updated 11 MAY 2018 )
Old Science
Weird Science
Treating Disease with Fecal Transplantation
Fossils of miniature humans (hobbits) discovered in Indonesia
Paleontology
Dinosaur tail, complete with feathers, found preserved in amber.
Astronomy
Mysterious fast radio burst (FRB) detected in the distant universe.
Big Data & Informatics
Big Data: Buzzword or Big Deal?
Hacking the genome: Identifying anonymized human subjects using publicly available data.