©The Author(s) 2021. Published by Baishideng Publishing Group Inc. All rights reserved.
Artif Intell Gastroenterol. Aug 28, 2021; 2(4): 105-110
Published online Aug 28, 2021. doi: 10.35712/aig.v2.i4.105
Application of artificial intelligence in microbiome study promotes precision medicine for gastric cancer
Zhi-Ming Li, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430030, Hubei Province, China
Zhi-Ming Li, Xuan Zhuang, Department of Urology, The First Affiliated Hospital of Xiamen University, Xiamen 361003, Fujian Province, China
Xuan Zhuang, Department of Clinical Medicine, Fujian Medical University, Fuzhou 350122, Fujian Province, China
Author contributions: Li ZM conceptualized the paper; Li ZM and Zhuang X wrote the paper; all authors read and approved the final manuscript.
Supported by Health Commission of Hubei Province Scientific Research Project, No. WJ2021Q023.
Conflict-of-interest statement: The authors declare that they have no competing interests to disclose.
: This article is an open-access article that was selected by an in-house editor and fully peer-reviewed by external reviewers. It is distributed in accordance with the Creative Commons Attribution NonCommercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/Licenses/by-nc/4.0/
Corresponding author: Zhi-Ming Li, PhD, Academic Fellow, Tongji Medical College, Huazhong University of Science and Technology, Baofeng Street, Qiaokou District, Wuhan 430030, Hubei Province, China. firstname.lastname@example.org
Received: March 16, 2021
Peer-review started: March 16, 2021
First decision: April 15, 2021
Revised: April 22, 2021
Accepted: July 9, 2021
Article in press: July 9, 2021
Published online: August 28, 2021
Gastric cancer (GC, also known as stomach cancer) is the second leading cause of cancer-related mortality globally, with over 70000 new cases diagnosed every year. The 5-year survival rate of GC is lower than 15%, even in the United States. According to Lauren's criteria, GC can be classified into two main types: Diffuse and intestinal. The diffuse type usually appears in younger patients and tends to be more aggressive, whereas the intestinal type is usually found in older patients and is caused by chronic infection with Helicobacter pylori (H. pylori). The microbiota in the stomach is extremely rich and complex. DNA sequencing and computational methods are making astounding advances in the identification of conserved ribosomal RNA (rRNA) genes for pathogenic microorganisms. More than 100 phylotypes have been uncovered in humans, and the majority of gastric microbiota falls within five phyla, including Bacteroidetes, Firmicutes, Proteobacteria, Actinobacteria, and Fusobacteria. H. pylori belongs to Proteobacteria. H. pylori infection triggers multistep progression from chronic gastritis, atrophic gastritis, and intestinal metaplasia to carcinoma finally. However, the issue of how the gastric microbiota interplays with H. pylori (namely, does the gastric microbiota lead to a more virulent H. pylori or, vice versa, does H. pylori facilitate the carcinogenesis of the microbiota?) is still not clear. This might have implications for clinical management.
Artificial intelligence (AI) is the simulation of human intelligence processes by computers and has been applied in various fields, such as image processing and natural language processing. AI is playing an increasingly important role in healthcare. It has been demonstrated that AI algorithms can support humans in simplifying the multidimensional, complex metagenomic data of gene profiling and elucidating the peculiar signatures of beneficial microbes in the gastrointestinal tract. As a core branch of AI, machine learning (ML) focuses on building mathematical models that help machines make predictions or decisions without being explicitly programmed. In the field of ML, deep learning (DL) has become the dominant approach for ongoing work with big data. DL, a subset of ML, is inspired by the information processing system discovered in the human brain. DL uses numerous layers of algorithms (artificial neural networks) to extract higher-level features from raw input. Briefly, ML is a core branch of AI, and DL is performed to implement ML. ML and DL have been successfully used to predict the risk of GC.
AI MAKES ACCURATE PREDICTIONS WITH BIG DATA AND THE GASTRIC MICROBIOME
Gastroenterology is a ﬁeld where AI can make a signiﬁcant difference. Traditional diagnostic methods have insufficient resolution ability to estimate the invasion depth of early GC in the clinic. Thus, over one-third of advanced GC cases with lesions around the cardia are not easily detected by image-based methods. However, AI-assisted image analysis using endoscopic detection can make more accurate assessments and provide more details than conventional analysis. There are still two main limitations in AI-assisted image analysis. First, there are relatively few data serving as learning and testing materials for building DL models. Second, the diagnostic accuracy is greatly affected when low-resolution images, which endoscopists usually encounter in clinical practice, are input. The above two points may cause certain defects in medical decisions based on image analysis. Remarkably, the combination of AI and the microbiome shows great potential in precision medicine for GC.
High-throughput sequencing is becoming a common technology for typing microbial isolates, especially in clinical samples. Many gene mutations, transcriptional differences, translational differences, epigenetic variations, and metabolic changes have been identified as being associated with the heterogeneity and stage of GC. High-throughput sequencing generates massive microbial data. A deep understanding of microbial data is helpful to explain the relationship between microbes and diseases. Virulence among H. pylori strains and host genetic polymorphisms contribute to GC susceptibility. AI algorithms effectively improve our understanding of the gastric microbiota due to two major advantages. First, AI methods can be applied to extract microbial genomic DNA from sequencing samples. Second, AI methods can simultaneously examine all genes in all organisms contained in a sample. Combined with other parameters, such as food habits, duration of infection, and physical activity, AI algorithms can provide better health advice to GC patients. A recent study has started to explore the ability of DL to treat diseases related to gut dysbiosis based on the individual’s microbiome pattern. In the future, researchers can develop AI algorithms to regulate the individual’s dietary intake and plan their meals when we fully understand the microbiome differences between people with and without disease (Figure 1).
Figure 1 Introducing artificial intelligence and microbiome study to precision medicine for gastric cancer
. The sequencing profiles of individual patient microbiomes are analyzed by artificial intelligence (AI), which helps patients to be classified into sub-groups. At the molecular level, AI reveals the molecular mechanisms of microbe-host interactions. At the individual level, AI allows gastric cancer patients to be treated with effective drugs, such as supplementing commensal bacteria, engineered bacteria, and microbiome-targeted drugs.
AI IDENTIFIES LOW ABUNDANCE MICROBES USING SEQUENCING DATA
Studying the microbiome composition of primary samples provides a chance to understand the role of pathogenic microorganisms in disease development. In the late 2000s, two large-scale international human microbiome projects (HMPs), Metagenomics of the Human Intestinal Tract and the HMP, were initiated to study microorganisms in the human body and to develop computational methods that analyze sequenced metagenomes. However, it seems challenging due to the low number of microbial DNA relative to the host DNA. Accurate identification of the microbiome requires the removal of all possible sequencing reads that originate from human DNA. Bacterial identification was commonly completed by characterization of uniform genomic coverage. For example, the sequence identity of 16S rRNA gene fragments greater than 97% can be classified into separate operational taxonomic units (OTUs), which means the phylogenetic boundaries of different bacterial species. Bacterial identification can also be completed based on coverage along a narrow region of their genomes. For example, analysis of amplicon sequence variants improves the sensitivity and specificity and decreases the problem of inflated microbiota datasets due to falsely identified OTUs originating from misclustered sequences. Recently, Lupolova et al found that ML algorithms made a good attribution of the host sources of S. enterica serovar Typhimurium isolates. The combination of 16S rRNA gene sequencing data and AI algorithms may reveal the essential role of low-abundance bacteria in the alteration of the gut microbiota composition.
It is challenging to quantify and characterize microbiome profiling in samples where the bacterial content is relatively low. The microbial community in the stomach is typically restricted by the lower luminal pH, which selects for acid-resistant bacterial populations and usually limits the colonization densities to < 1000 colony-forming units per gram (CFU/g). The current approach for detecting the bacteria of fecal or environmental samples cannot be directly used to analyze the microbiome from the upper gastrointestinal tract, such as the stomach. This is partly because the high amount of human DNA in the samples confounds microbial identification. Klein et al designed a DL algorithm that can be used to detect H. pylori on regular whole slide images of gastric biopsies, achieving a sensitivity of 100%. Detecting the low abundance bacteria without sample processing facilitates the establishment of a rapid diagnostic method. Recently, we designed magnetic nanoparticles with a broad range of capture potentials via electrostatic attractions. This system can rapidly and efficiently capture bacteria at a low concentration of 10 CFU/mL within 1 h. The capture efficiency was more than 90%. It can be used to evaluate the microbiome profile of gastric biopsies in future studies.
AI UNCOVERS HOST-MICROBIOME INTERACTIONS
A comparative study of GC and chronic gastritis using an approach targeting the 16S rRNA gene of mucosal biopsies showed that bacterial diversity was decreased in GC patients. Patients with GC had a large number of non-Helicobacter Proteobacteria. Colonization with bacteria other than H. pylori breaks the balance between the resident gastric microbiota and the host, which may increase the risk for H. pylori-related cancer. Another study evaluated the microbiota composition in normal, peritumoral, and tumoral tissues by 16S rRNA gene profiling and found that microbial diversity was significantly reduced in peritumoral and tumoral microhabitats. H. pylori, Prevotella copri, and Bacteroides uniformis were relatively less abundant in the tumoral microhabitat, whereas Prevotella melaninogenica, Streptococcus anginosus, and Propionibacterium acnes were more abundant. The authors proposed the hypothesis that chronic atrophic gastritis with atrophy (the acidity of the microenvironment of the stomach is reduced) was attributed to H. pylori substitution by a cancer-prone microbiota. Additionally, the same research team found a close relationship between the subtype of immune cells (regulatory T cells and plasmacytoid dendritic cells) and gastric microbiota dysbiosis within the tumor microenvironment. It is already known that H. pylori infection functions in the development of precancerous lesions, such as chronic gastritis. Nevertheless, the dramatic changes in the composition of the stomach microbiome play a more direct role in the later stages of cancer. Moreover, the microbiome affects the therapeutic response of GC patients, and the treatment also impacts microbial composition. Distal gastrectomy impacts postoperative gut microbiota composition, leading to higher abundances of Escherichia, Shigella, Veillonella, and Clostridium XVIII and a lower abundance of Bacteroides. Immune checkpoint inhibitors targeting programmed cell death 1 (PD-1)/programmed cell death ligand 1 were recently added to the therapeutic arsenal for GC. The microbiome composition interferes with the response to these inhibitors. A recent study reported that nonresponders to PD-1 blockade immunotherapy can be distinguished from responders according to the ratio of putatively favorable to unfavorable bacteria. Thus, the role of the microbiome in cancer-immune interactions is gaining much attention. When we learn more about host-microbiome interactions, nonresponders to checkpoint inhibitors are easier to select and treat by personalized immunotherapy.
Due to the practical limitations of analysis methods, there are still large gaps on how the microbiome mechanically affects host function at the system and community levels. Notably, the past few decades has seen significant work on AI in filling these existing gaps. AI algorithms can co-analyze heterogeneous datasets and capture changes at the microbial and host levels. These methods can be classified into four types: Interfering protein-protein interactions, interfering RNA-mediated interactions, interfering microbe-host metabolic networks, and integrating multiple interspecies and intraspecies networks and omic datasets. The powerful multiomics tools and rapidly developed AI algorithms can greatly enhance or perhaps revolutionize microbiome research. This collaboration provides hopeful expectations to improve our current understanding of GC mechanisms, as well as better detection and treatment.
We live in a world surrounded by data and microbes. The gastric microbiome occupies an important position in maintaining the individual’s health. A large quantity of complex sequencing data are generated by high-throughput technologies. However, inherent challenges still exist in data processing, including confounding variables from abundant organisms, the integration of different omics data, and the relationships between microbes and their hosts. Currently, big data are easier than ever to analyze due to the assistance of AI technologies. AI is evolving as an important tool for the proposal of new biological hypotheses and the discovery of biomarkers from the available data. In the future, the renewal of the stomach of dysbiosis patients may be achieved by synthetic biology and food engineering based on our understanding of the microbiome and the performance of AI.
Manuscript source: Invited manuscript
Specialty type: Gastroenterology and hepatology
Country/Territory of origin: China
Peer-review report’s scientific quality classification
Grade A (Excellent): 0
Grade B (Very good): B
Grade C (Good): C, C, C
Grade D (Fair): 0
Grade E (Poor): 0
P-Reviewer: Abreu de Melo MI, Kinami S, Moradi L, Sharma J S-Editor: Fan JR L-Editor: Wang TQ P-Editor: Li JH