An Evaluation Method of Human Gut Microbial Homeostasis by Testing Specific Fecal Microbiota

Zhongwen Wu , Xiaxia Pan , Yin Yuan , Pengcheng Lou , Lorina Gordejeva , Shuo Ni , Xiaofei Zhu , Bowen Liu , Lingyun Wu , Lanjuan Li , Bo Li

Engineering ›› 2023, Vol. 29 ›› Issue (10) : 110 -119.

PDF (2910KB)
Engineering ›› 2023, Vol. 29 ›› Issue (10) :110 -119. DOI: 10.1016/j.eng.2023.03.007
Research
Article
An Evaluation Method of Human Gut Microbial Homeostasis by Testing Specific Fecal Microbiota
Author information +
History +
PDF (2910KB)

Abstract

Research on microecology has been carried out with broad perspectives in recent decades, which has enabled a better understanding of the gut microbiota and its roles in human health and disease. It is of great significance to routinely acquire the status of the human gut microbiota; however, there is no method to evaluate the gut microbiome through small amounts of fecal microbes. In this study, we found ten predominant groups of gut bacteria that characterized the whole microbiome in the human gut from a large-sample Chinese cohort, constructed a real-time quantitative polymerase chain reaction (qPCR) method and developed a set of analytical approaches to detect these ten groups of predominant gut bacterial species with great maneuverability, efficiency, and quantitative features. Reference ranges for the ten predominant gut bacterial groups were established, and we found that the concentration and pairwise ratios of the ten predominant gut bacterial groups varied with age, indicating gut microbial dysbiosis. By comparing the detection results of liver cirrhosis (LC) patients with those of healthy control subjects, differences were then analyzed, and a classification model for the two groups was built by machine learning. Among the six established classification models, the model established by using the random forest algorithm achieved the highest area under the curve (AUC) value and sensitivity for predicting LC. This research enables easy, rapid, stable, and reliable testing and evaluation of the balance of the gut microbiota in the human body, which may contribute to clinical work.

Graphical abstract

Keywords

Gut microbiota / Machine learning / Microbial dysbiosis / Quantitative polymerase chain reaction / Chinese cohort

Cite this article

Download citation ▾
Zhongwen Wu, Xiaxia Pan, Yin Yuan, Pengcheng Lou, Lorina Gordejeva, Shuo Ni, Xiaofei Zhu, Bowen Liu, Lingyun Wu, Lanjuan Li, Bo Li. An Evaluation Method of Human Gut Microbial Homeostasis by Testing Specific Fecal Microbiota. Engineering, 2023, 29(10): 110-119 DOI:10.1016/j.eng.2023.03.007

登录浏览全文

4963

注册一个新账户 忘记密码

1. Introduction

The human gut is a complex and intricate mini-ecosystem that mediates interactions between the host and the environment. The human gut contains trillions of microorganisms, such as bacteria, fungi, viruses, and other life forms, most of which exist in the colon. These microorganisms and the intestinal environment together constitute the intestinal microecology, and its diversity is the result of the coevolution of the intestinal microbiota and host [1], [2], [3]. The composition of the intestinal microecology is easily affected by many factors, such as diet [4], [5], age [6], [7], sex [7], genetics [8], and medicine [1].

The composition of the intestinal microbiota plays a fundamental role in regulating human health and diseases [9]. In addition to participating in human digestive function, the intestinal microbiota can affect human development, growth, and physiology, including organ development and morphogenesis and metabolism [10], [11]. The intestinal microbiota also plays an indispensable role in the development and induction of the human immune system; it regulates the differentiation of immune cells and the production of immune mediators to maintain interaction between the host and the intestinal microflora [12], [13], [14]. The destruction of the normal intestinal microbiota will increase the risk of infection and excessive proliferation of harmful pathogens and the occurrence of inflammatory diseases [12]. Qin et al. [15] studied the changes in the gut microbiota of patients with liver cirrhosis (LC) and found that at the genus level, Bacteroides was the dominant phylotype in both groups, but its abundance was significantly decreased in the LC group [15]. Therefore, it is very important for a healthy host to maintain microecological homeostasis. Indeed, an unbalanced intestinal microecology leads to the occurrence of a variety of diseases, including liver diseases, gastrointestinal diseases, metabolic diseases, and cardiovascular diseases [16], [17], [18], [19].

Therefore, it is of great significance to routinely acquire the status of the human gut microbiota in a timely manner for the auxiliary evaluation of overall health and disease prediction. In the past, the coccus/bacillus (C/B) ratio was commonly used to reflect gut bacterial homeostasis [20], [21] based on traditional methods, such as bacterial culture or microscopic examination. More recently, with the maturity of polymerase chain reaction (PCR) technology and the microbial sequencing technologies developed from it, more bacterial species can be detected, and the amount of some key bacterial species, bacterial ratios, or other indices have been selected to serve as balance indicators for the intestinal microecology. For example, it was found that the concentration of Bifidobacterium/concentration of Enterobacteriaceae (B/E) ratio can be used to judge the extent of intestinal microecology dysbiosis in the progression of liver diseases [22], [23]. Low et al. [24] found that an increase in the abundance of Klebsiella/abundance of Bifidobacterium (K/B) ratio in early infants is a potential indicator of an increased risk of allergic disease. In addition, Ley et al. [25] first proposed that a higher abundance of Firmicutes/abundance of Bacteroidetes (F/B) ratio in the gut was likely to result in obesity. Microbial sequencing technologies have now become the mainstream method for studying the gut microbiota because the advantage of being high throughput screening. Microbial studies were mostly based on 16S ribosomal RNA (rRNA) gene sequencing [26], [27], [28] and metagenomic sequencing [29], [30]. However, these two kinds of approaches are expensive and inefficient and are generally very cumbersome. As a result, microbial sequencing is not appropriate for routine detection of the gut microbiota in a large population. For rapid testing and simplicity as well as the absolute quantification function [31], [32], a good alternative approach for carrying out routine gut bacterial detection is quantitative PCR (qPCR).

Based on previous studies [15], [32], this study attempted to establish an effective and routine gut bacterial detection procedure; ten predominant bacterial groups in the human intestinal tract were detected via qPCR, including probiotics (Lactobacillus and Bifidobacterium), opportunistic pathogens (Enterobacteriaceae, Enterococcus, Bacteroides, and Atopobium), and other health-promoting symbiotic bacteria (Faecalibacterium prausnitzii (F. prausnitzii), Clostridium butyricum (C. butyricum), Clostridium leptum (C. leptum), and Eubacterium rectale (E. rectale)). These ten representative bacteria were selected from a large sample cohort of healthy people, which were considered potential indices for the evaluation of the whole human gut microbiome. Furthermore, we sought to obtain reference ranges of a healthy population and the changing patterns of the ten bacterial groups and their pairwise ratios with aging to pave the way for subsequent studies on large-sample disease-specific populations. Moreover, to investigate the probable difference in the qPCR detection results between healthy people and people with specific diseases, we evaluated patients with LC in comparison to healthy control (HC) subjects. To further test the capability of qPCR detection to distinguish people with diseases from the general population, we utilized a machine learning algorithm to mine the information of the detection results of the LC population and the HC population and built several classification models to finally select an optimal one.

2. Material and methods

2.1. Volunteer recruitment and sample collection

A total of 510 healthy subjects and 248 patients with LC were recruited; the inclusion and exclusion criteria are listed in Section S1 in Appendix A. Feces of the first defecation in the morning were collected in a clean plastic bag tucked into a disposable plastic bowl, and after defecation, the plastic bag was then fastened tightly to avoid urine pollution. To avoid interference factors, such as food residues, the softer part of the fresh feces was selected and loaded into cryopreservation tubes within half an hour. A DNA stabilizer (Invitek, Germany) was added, and the cryopreservation tubes were numbered before storage at -80 °C for preservation. On the morning of sample collection, a blood sample was collected from the patients and was subjected to routine blood tests, blood biochemistry tests, C-reactive protein (CRP) tests, and tests for other indicators. The basic information of all volunteers (Table 1), including age, sex, and body mass index (BMI), was registered. Volunteers with unqualified fecal samples and incomplete basic information were excluded; 500 healthy people and 244 LC patients were included. All of the work was performed according to guidelines approved by the Research Ethics Committee of the First Affiliated Hospital, College of Medicine, Zhejiang University (No. 2019-1026 and 2022-874).

2.2. qPCR assessment for major gut bacterial species

Microbial DNA was extracted from the feces of the 744 volunteers with a MegaBio soil/fecal genomic DNA purification kit (Bioer, Inc., China). The specific steps are described in Section S2 in Appendix A. The concentration of total DNA of fecal microorganisms (Ct) in each sample DNA eluate was detected using Nanodrop One (Thermo Fisher Scientific, USA). Ten predominant bacterial populations in the intestine were detected by real-time qPCR. Primers were synthesized by GenScript (China), and the primer information is listed in Table S1 in Appendix A. A ViiA™ PCR system (Applied Biosystems, Inc., USA) was used to conduct qPCR on 744 fecal bacterial DNA samples with a reaction volume of 20 μL, including 10 μL of SYBR Green PCR Master Mix (Zhongnuo Gene, Inc., China), 8 μL of primer pairs (0.2-0.6 μmol·L−1), and 2 μL of template DNA or 2 μL of distilled water (negative control). The reaction conditions are listed in Table S2 in Appendix A. Each reaction was performed in triplicate, and the cycle threshold (ΔCT) between repeats was required to be less than 0.5. Plasmid DNA standards containing the corresponding amplification fragment of each primer group were diluted in multiple gradient ratios and amplified with the bacterial DNA templates in the same PCR plate. The copies of the target bacteria in the DNA template were determined by comparison with the standard curve obtained from amplification of the corresponding bacterial DNA standards. The final concentration of the target bacteria was obtained by dividing the concentration of the target bacteria in the DNA template (N) by the total DNA concentration of fecal microorganisms in each sample DNA eluate and the volume of the template (V). The unit of the concentration is copies per nanogram total DNA of fecal microorganisms, hereinafter referred to as copies·ng−1. The formula is listed in Section S4 in Appendix A.

2.3. Statistical analysis

All statistical analyses and plotting/graphical drawings in this study were performed using R script (version 4.1.2), Rstudio software (version 2022.07.1+554), and Origin 2021 (version 9.95). ggplot2 pack (version 3.3.5) was used for plotting/graphical drawings. The heatmap for correlation analysis was drawn by the corrplot package (version 0.92). The heatmap for comparing the quantity of the ten bacterial species was drawn by the pheatmap package (version 1.0.12). Data preprocessing was completed by the dplyr package (version 1.0.8). The processing of all outliers of the data adopts the box-plot method, and values less than Q1-1.5* interquartile range (IQR) or greater than Q3+1.5* IQR were determined to be outliers. These outliers were removed from the corresponding data groups. The reference range (bilateral) is calculated by taking a 95% confidence interval (95% CI) after removing outliers from the data of the healthy people. The normal distribution method was adopted for normally distributed data, and the upper and lower limits were X ± (1.96 × SD) (SD: standard deviation). The percentile method was adopted for nonnormally distributed data, and the upper and lower limits were P2.5 and P97.5, respectively. The Shapiro-Wilk test and Kolmogorov-Smirnov test were used to test the normal distribution of ten bacterial species. The Wilcoxon rank sum test was used for the analysis of differences between the two groups. The Kruskal-Wallis test was used to analyze the differences in nonnormal data among multiple groups. Spearman rank correlation analysis was used to analyze the correlation between the liver function indices and the ten associated bacterial species.

2.4. Machine learning

Machine learning algorithms were used to build models for distinguishing between cirrhotic and noncirrhotic samples. A total of 744 clinical samples (244 cirrhosis samples and 500 HC samples) from the same sample collection area were used as the dataset for constructing the model. The random sampling method was utilized to divide the data into training data and test data according to the ratio of 75-25. There were 408 training data points (183 cirrhosis data points and 225 noncirrhosis data points) and 136 test data points (61 cirrhosis data points and 75 noncirrhosis data points).

Through pretest screening, six machine learning methods were finally used to classify and model the data, including RF [33], GBM [34], AdaBoost [35], XGBoost [36], SVM_poly, and SVM_Gauss [37], with the first four methods belonging to ensemble learning [38].

The content of ten predominant bacterial species in the clinical samples and patient sex were selected as characteristics of the training model [7]. A tenfold cross-validation RF model was used to explore the importance of these features, and the mean decrease the Gini index was employed as a metric to determine the importance of these characteristics and their contribution to the model.

Repeated tenfold cross-validation (ten repeats) was applied to build and verify the model, and hyperparameter tuning was used to tune the six models. Through hyperparameter tuning, the optimal set of hyperparameter values of each of the six models was obtained, and the optimal model for each model was established according to the set of these values. The hyperparameter optimization of SVM_poly, and other models are shown in Appendix A. Finally, the area under the curve (AUC) was used to evaluate the quality of the trained model, and the model with the highest average AUC value was selected as the optimal model. When overfitting occurred, the suboptimal model was chosen as the final result. The corresponding AUC value, sensitivity, and specificity of each model were obtained from the test data to validate the final model generated by the six machine learning algorithms.

The machine learning analysis process was implemented under the caret machine learning framework of R (version 6.0-93). RF was implemented using RF (version 4.7-1.1). Gradient upgrade was implemented using the gbm package (version 2.1.8.1). AdaBoost was implemented using the adabag package (version 4.2). XGBboost was implemented using the XGBoost package (version 1.6.0.1). Two support vector machine methods were implemented using the kernlab package (version 0.9-31).

2.5. Ethics

This study was approved by the Research Ethics Committee of the First Affiliated Hospital, College of Medicine, Zhejiang University (No. 2019-1026 and 2022-874). The patients/participants provided written informed consent to participate in this study. The research protocol complies with the ethical guidelines of the 1975 Declaration of Helsinki.

3. Results

3.1. The predominant gut microbiota in healthy humans

A total of 500 healthy humans (275 males and 225 females) were included in this study. Healthy subjects were recruited based on strict exclusion criteria. After quality control, ten predominant gut bacterial species were detected by qPCR, and all data were statistically analyzed and processed to find the reference range for healthy people (Table 2). Furthermore, given that the structure and abundance of the gut microbiota are altered in people of different ages, we also analyzed and found the reference range for people of different ages (Table S3 in Appendix A). Healthy people of different ages were divided into five groups at intervals of 20 years: 0-20, 20-40, 40-60, 60-80, and 80-100 years. Except for E. rectale, the species showed significant differences among the five age groups. Among those species with significant differences, the abundance of Atopobium had the most significant change (P = 5.7 × 10−8), followed by Enterococcus (P = 1.9 × 10−7), and only E. rectale did not show significant differences with aging (P = 0.081).

3.2. Pairwise ratios are potential indicators of gut microbial homeostasis

Studies have reported that the abundance of Firmicutes decreases, and that of Bacteroidetes increases in almost all disease situations [39]. Therefore, we hypothesized that pairwise ratios are potential indicators for evaluating the balance of the gut microbiota. We compared the concentrations of the above ten bacterial species as logarithmic values (Fig. 1(a)). We used the pairwise ratio, and plotted the trends for a total of 45 ratios. Interestingly, we did find that the B/E ratio showed a typical trend of first decreasing and then increasing with age, displaying a U-shaped curve (P = 0.021). However, more studies are still needed. Moreover, the Enterococcus/Enterobacteriaceae (Ec/E) ratio, an indicator that significantly increases in critical patients [32], showed an increasing trend with age (P = 0.00025). The ratio of the two bacterial species with the largest difference between groups was C. leptum/Bacteroides (P = 2.3 × 10−7), and its increasing trend with age was also an obvious U-shaped curve that decreased first and then increased (Figs. 1(b) and (c)).

3.3. An imbalanced gut microbiota in LC patients

To further verify whether the above ten predominant bacterial species may be used as indicators for microbial homeostasis, we collected and measured 244 fecal samples from LC patients. By comparing the results of healthy people with those of LC patients, it was found that there were significant differences between seven bacterial species (Fig. 2(a)); in contrast, F. prausnitzii, C. leptum, and Atopobium showed no differences between the two populations. Among the seven bacterial species, Enterococcus, E. rectale, and Bacteroides exhibited the largest differences (P = 2.50 × 10−12, 3.47 × 10−10, and 6.57 × 10−10, respectively). This was followed by C. butyricum (P = 1.76 × 10−5), Lactobacillus (P = 1.13 × 10−3), Enterobacteriaceae (P = 5.59 × 10−3), and Bifidobacterium (P = 9.71 × 10−3), which is consistent with the results visualized in the heatmap (Fig. 2(b)). In addition, concentrations of Bifidobacterium, E. rectale, Enterobacteriaceae, and C. butyricum were relatively stable in both healthy individuals and cirrhosis patients, with few outliers. The number of outliers of the four bacterial species in cirrhosis patients and healthy people was (1/244, 1/500), (0/244, 2/500), (4/244, 1/500), and (2/244, 3/500), respectively.

Furthermore, compared to healthy individuals, the ratios of Ec/E. rectale (P = 3.08 × 10−24), C. leptum/Bacteroides (P = 2.14 × 10−18), and C. butyricum/E. rectale (P = 2.54 × 10−18) were significantly different in cirrhosis patients, indicating that the gut microbiota was imbalanced in cirrhosis patients (Fig. 2(c)). These results are consistent with our previous work [15]. However, the B/E ratio remained almost unchanged in this work.

3.4. The gut microbiota is associated with the severity of LC

Correlation analysis of the gut microbiota and liver function indicators revealed that the serum levels of alternate (ALT), albumin (ALB), direct bilirubin (DBIL), triglyceride (TG), and total bile acid (TBA) positively correlated with the abundance of Bacteroides but negatively correlated with that of the other nine bacterial species. The serum level of alkaline phosphatase (ALP) negatively correlated with the abundance of Bacteroides but positively correlated with that of the other nine bacterial species. Negative correlations between the abundance of Enterobacteriaceae and the B/E ratio were also found (Fig. 3).

3.5. Multiple machine learning models to distinguish and predict healthy people and patients with LC

Furthermore, we constructed multiple machine learning models to analyze the above results of the gut microbiota and to distinguish HC subjects from LC patients (Fig. 4(a)). By tuning the hyperparameters of the six models, the optimal combination of hyperparameters was obtained. The relationship between the hyperparameters of the SVM_poly model based on the polynomial kernel and the model performance is shown in Fig. 4(b). From the figure, the optimal combination of the hyperparameters is polynomial degree = 2, scale = 0.100, and C = 0.75. Relationship diagrams between the estimates of performance and the tuning parameters of the other five models are shown in Figs. S1-S3 in Appendix A.

The training results of the six models showed the XGBoost model to have the best training results, with an average AUC value reaching 0.9376 (95% CI, 0.9158-0.9595). This was followed by SVM_Gauss and SVM_poly, with average AUCs reaching 0.9050 (95% CI, 0.8754-0.9346) and 0.9040 (95% CI, 0.8749-0.9331), respectively. The worst training result was observed for the RF model, but the average AUC also reached 0.8746 (95% CI, 0.8414-0.9078) (Fig. 4(c)).

Then, the machine learning model was also used to analyze 50 real clinical samples. The test results (Table 3) show that the six models achieved good prediction results, among which RF achieved the best AUC value, reaching 0.8776 (95% CI, 0.8159-0.9393). This was followed by XGBoost and Adaboost, with AUC values reaching 0.8726 (95% CI, 0.8097-0.9355) and 0.8630 (95% CI, 0.7969-0.9290), respectively. The three models with the highest sensitivity were RF, XGBoost, and Adaboost, with values of 0.8361, 0.7869, and 0.7869, respectively. The three models with the highest specificity were XGBoost, Adaboost, and GBM, with values of 0.8667, 0.8533, and 0.8400, respectively (Fig. 4(c) and Table S4 in Appendix A).

4. Discussion

Gut microbiota dysbiosis is an abnormal change in the number, proportion, and species of the normal microbiota in the gut that affects human health, leading to a series of abnormal physiological and pathological phenomena [40], [41]. Homeostasis between the gut microbiota and the host immune system is compromised when the former is imbalanced [42]. The gut microbiota changes significantly during human aging [43]. Zhang et al. [7] found consistent changes in gut microbiota during aging in humans, as characterized by increased α diversity. The abundance of multiple members of the oral microbiota, Enterobacteriaceae and Clostridia, which are short-chain fatty acid (SCFA) producers, increased with age. They also found that several Bifidobacterium species (B. breve, B. bifidum, B. longum, and B. adolescentis) negatively correlated with age. Biagi et al. [44] revealed that the cumulative abundance of symbiotic bacterial taxa (mostly belonging to the dominant Ruminococcaceae, Lachnospiraceae, and Bacteroidaceae families) decreased with age. Nevertheless, health-related Akkermansia, Bifidobacterium, and Christensenellaceae were enriched in elderly individuals, especially semisupercentenarians (105-109 years old). Another study also found a decrease in the abundance of core genera in the gut microbiota of healthy aging individuals, particularly Bacteroides, which is associated with longer life expectancy [45]. While different studies have identified gut microbiota changes during human aging, there is no consensus on the changing patterns of the gut microbiome during aging. Therefore, in this study, we aimed to establish the reference range for all ages as well as for different age groups of healthy people based on the results of a large cohort and to determine underlying characteristics during healthy aging.

Based on our previous work, we detected ten predominant gut bacterial species in each healthy individual by qPCR. The ten predominant gut bacterial species selected for this study were found to play an important role in maintaining intestinal homeostasis. For instance, Lactobacillus and Bifidobacterium are important for regulating immunity and maintaining gut barrier function [46], [47], [48], and F. prausnitzii, C. butyricum, C. leptum, and E. rectale can produce SCFAs, which are important in inhibiting the overgrowth of opportunistic pathogens, maintaining the integrity of the intestinal epithelial barrier, and enhancing immunity [49], [50], [51]. Our results revealed that the concentrations of the ten predominant gut bacterial species changed differently with age. The healthy population was divided into five groups at intervals of 20 years (0-20, 20-40, 40-60, 60-80, and 80-100). Nine of the ten predominant bacterial species showed some variation in all five age groups. Atopobium showed the most significant difference (P = 5.7 × 10−8); this was followed by Enterococcus (P = 1.9 × 10−7). In addition, except for Bacteroides, the concentration of which increased and then decreased with age, the other bacteria showed a general trend of decreasing and then increasing. Most large-sample studies have been based on 16S rRNA or metagenomic sequencing, but one of the main limitations of sequencing is that taxa can only be assigned according to the sequence of a single region in the bacterial genome, and only relative abundance results are obtained, which makes it difficult to generate a stable and reliable reference value [31], [52]. In contrast, qPCR can be used to quantitatively detect bacterial concentration by calibration with known concentrations of standard substances, which has the characteristics of good repeatability, rapid results, and simple operation. Timely detection of intestinal microbial dysbiosis is helpful for clinicians to achieve early diagnosis and treatment and to improve prognosis [32], [53].

Our previous study analyzed the changes in the gut microbiota of patients with LC. We found that at the genus level, Bacteroides was the dominant phylotype in both groups, but its abundance was significantly decreased in the LC group [15]. Of the remaining genera, Veillonella, Streptococcus, Clostridium, and Prevotella were enriched in the LC group, while Eubacterium and Alistipes were dominant in the HC subjects. Of the species that decreased the most in abundance in the LC group, twelve were Bacteroidetes and seven were Firmicutes, specifically from the order Clostridiales. However, the cost of metagene sequencing is high, and the amount of data is large, which requires a large amount of computing resources to perform analysis. Therefore, in this study, we aimed to characterize the changes in the gut microbiota of cirrhosis patients using a smaller number of bacteria. Moreover, we also constructed multiple machine learning models to further analyze the microecological results. Interestingly, we found that the prediction results of the classification models built by four algorithms under the framework of ensemble learning were better than those of SVM and some other machine learning algorithms. Among them, the classification model of RF achieved the highest AUC value and sensitivity, and the classification model of XGBoost achieved the second highest AUC value and the highest specificity, which were relatively better than those of the other four models. Comparing the two classification models, the RF model had a higher AUC value and sensitivity and better identified patients with cirrhosis or suspected cirrhosis, but there were more false-positives. The XGBoost model showed higher specificity and was better able to identify healthy people. However, there were more false-negatives, which may result in unrecognized cirrhosis patients failing to visit the hospital in a timely manner, delaying diagnosis and treatment. The RF model with the highest sensitivity was the best choice. Although the RF model may cause some healthy people to be misdiagnosed, the benefits of finding more patients with true cirrhosis in a timely manner outweigh the risks. However, geography and location have strong influences on the human gut microbiome, and geographical differences limit the application of the reference range of the healthy gut microbiome and disease models. To verify the prevalence of microbiota differences between healthy or disease states, standardized experimental protocols, regional study designs, and extensive sampling are needed, and geography as a feature needs to be added to the model. Different predictive models need to be trained for different geographical regions [54], [55]. Considering the influence of region on the human microbiota, samples from different regions will be collected in the future to re-establish the classification model. We will adopt two strategies: ① add region as an additional feature to the model and ② establish classification models according to different regions relatively independently and select a better classification model according to the results.

The combination of qPCR and machine learning makes is an easy, rapid, stable, and reliable method to test and analyze the gut microbiome of the human body. With the inclusion of new samples, our healthy population cohort will be subsequently expanded, the reference range for the healthy population results will be updated as the sample size expands, and the capability of relevant machine learning models to predict illnesses will be improved through continuous learning and prediction, which may contribute to clinical work.

5. Conclusions

Ten kinds of predominant gut bacterial species that characterize the whole microbiome in the human gut were found from a large-sample Chinese cohort. We established the reference ranges of these ten predominant gut bacterial groups by detecting their concentrations by qPCR and discovered the changing patterns of the ten bacterial groups with aging and disease. In addition, we utilized machine learning algorithms to deeply extract differential information from the detection results and built and selected a reliable classification model for predicting LC. This study revealed that it is highly necessary to describe and predict the changes in gut microbiota in a healthy Chinese population with a small amount of information. Based on this healthy range, it can be widely used to predict and describe intestinal microecological dysbiosis in various diseases. However, more new theoretical models and clinical practice are still needed in future work.

References

[1]

R.K. Weersma, A. Zhernakova, J. Fu. Interaction between drugs and the gut microbiome. Gut, 69 (8) ( 2020), pp. 1510-1519. DOI: 10.1136/gutjnl-2019-320204

[2]

R.E. Ley, D.A. Peterson, J.I. Gordon. Ecological and evolutionary forces shaping microbial diversity in the human intestine. Cell, 124 (4) ( 2006), pp. 837-848

[3]

H. Brody. The gut microbiome. Nature, 577 (7792) ( 2020), p. S5. DOI: 10.1038/d41586-020-00194-2

[4]

T.S. Ghosh, S. Rampelli, I.B. Jeffery, A. Santoro, M. Neto, M. Capri, et al.. Mediterranean diet intervention alters the gut microbiome in older people reducing frailty and improving health status: the NU-AGE 1-year dietary intervention across five European countries. Gut, 69 (7) ( 2020), pp. 1218-1228. DOI: 10.1136/gutjnl-2019-319654

[5]

M.J. Claesson, I.B. Jeffery, S. Conde, S.E. Power, E.M. O'Connor, S. Cusack, et al.. Gut microbiota composition correlates with diet and health in the elderly. Nature, 488 (7410) ( 2012), pp. 178-184. DOI: 10.1038/nature11319

[6]

M.J. Claesson, S. Cusack, O. O'Sullivan, R. Greene-Diniz, H. de Weerd, E. Flannery, et al.. Composition, variability, and temporal stability of the intestinal microbiota of the elderly. Proc Natl Acad Sci USA, 108 (Suppl 1) ( 2011), pp. 4586-4591. DOI: 10.1073/pnas.1000097107

[7]

X. Zhang, H. Zhong, Y. Li, Z. Shi, H. Ren, Z. Zhang, et al.. Sex- and age-related trajectories of the adult human gut microbiota shared across populations of different ethnicities. Nature Aging, 1 ( 2021), pp. 87-100

[8]

A. Burberry, M.F. Wells, F. Limone, A. Couto, K.S. Smith, J. Keaney, et al.. C9orf 72 suppresses systemic and neural inflammation induced by gut bacteria. Nature, 582 (7810) ( 2020), pp. 89-94. DOI: 10.1038/s41586-020-2288-7

[9]

A. Heintz-Buschart, P. Wilmes. Human gut microbiome: function matters. Trends Microbiol, 26 (7) ( 2018), pp. 563-574

[10]

W.H. Karasov, C. Martinez del Rio, E. Caviedes-Vidal. Ecological physiology of diet and digestive systems. Annu Rev Physiol, 73 ( 2011), pp. 69-93. DOI: 10.1146/annurev-physiol-012110-142152

[11]

F. Sommer, F. Backhed. The gut microbiota—masters of host development and physiology. Nat Rev Microbiol, 11 (4) ( 2013), pp. 227-238. DOI: 10.1038/nrmicro2974

[12]

N. Kamada, S.U. Seo, G.Y. Chen, G. Nunez. Role of the gut microbiota in immunity and inflammatory disease. Nat Rev Immunol, 13 (5) ( 2013), pp. 321-335. DOI: 10.1038/nri3430

[13]

P.M. Smith, W.S. Garrett. The gut microbiota and mucosal T cells. Front Microbiol, 2 ( 2011), p. 111

[14]

S. Kawamoto, T.H. Tran, M. Maruya, K. Suzuki, Y. Doi, Y. Tsutsui, et al.. The inhibitory receptor PD-1 regulates IgA selection and bacterial composition in the gut. Science, 336 (6080) ( 2012), pp. 485-489. DOI: 10.1126/science.1217718

[15]

N. Qin, F. Yang, A. Li, E. Prifti, Y. Chen, L. Shao, et al.. Alterations of the human gut microbiome in liver cirrhosis. Nature, 513 (7516) ( 2014), pp. 59-64. DOI: 10.1038/nature13568

[16]

F. Frost, T. Kacprowski, M. Ruhlemann, M. Pietzner, C. Bang, A. Franke, et al.. Long-term instability of the intestinal microbiome is associated with metabolic liver disease, low microbiota diversity, diabetes mellitus and impaired exocrine pancreatic function. Gut, 70 (3) ( 2021), pp. 522-530. DOI: 10.1136/gutjnl-2020-322753

[17]

S.V. Lynch, O. Pedersen. The human intestinal microbiome in health and disease. N Engl J Med, 375 (24) ( 2016), pp. 2369-2379

[18]

B. Schnabl, D.A. Brenner. Interactions between the intestinal microbiome and liver diseases. Gastroenterology, 146 (6) ( 2014), pp. 1513-1524

[19]

J.S. Bajaj, D.M. Heuman, P.B. Hylemon, A.J. Sanyal, M.B. White, P. Monteith, et al.. Altered profile of human gut microbiome is associated with cirrhosis and its complications. J Hepatol, 60 (5) ( 2014), pp. 940-947

[20]

X. Yu, X. Zhang, H. Jin, Z. Wu, C. Yan, Z. Liu, et al.. Zhengganxifeng decoction affects gut microbiota and reduces blood pressure via renin-angiotensin system. Biol Pharm Bull, 42 (9) ( 2019), pp. 1482-1490. DOI: 10.1248/bpb.b19-00057

[21]

Y. Zhang, Y. Wang, B. Ke, J. Du. TMAO: how gut microbiota contributes to heart failure. Transl Res, 228 ( 2021), pp. 109-125. DOI: 10.3390/electronics10020109

[22]

H. Lu, Z. Wu, W. Xu, J. Yang, Y. Chen, L. Li. Intestinal microbiota was assessed in cirrhotic patients with hepatitis B virus infection. Intestinal microbiota of HBV cirrhotic patients. Microb Ecol, 61 (3) ( 2011), pp. 693-703. DOI: 10.1007/s00248-010-9801-8

[23]

Li L. Infectious microecology: theory and applications. Hangzhou: Zhejiang University Press; 2014.

[24]

J.S.Y. Low, S.E. Soh, Y.K. Lee, K.Y.C. Kwek, J.D. Holbrook, E.M. van der Beek, et al.. Ratio of Klebsiella/Bifidobacterium in early life correlates with later development of paediatric allergy. Benef Microbes, 8 (5) ( 2017), pp. 681-695. DOI: 10.3920/BM2017.0020

[25]

R.E. Ley, P.J. Turnbaugh, S. Klein, J.I. Gordon. Microbial ecology: human gut microbes associated with obesity. Nature, 444 (7122) ( 2006), pp. 1022-1023. DOI: 10.1038/4441022a

[26]

P.J. Turnbaugh, M. Hamady, T. Yatsunenko, B.L. Cantarel, A. Duncan, R.E. Ley, et al.. A core gut microbiome in obese and lean twins. Nature, 457 (7228) ( 2009), pp. 480-484. DOI: 10.1038/nature07540

[27]

J.J. Faith, J.L. Guruge, M. Charbonneau, S. Subramanian, H. Seedorf, A.L. Goodman, et al.. The long-term stability of the human gut microbiota. Science, 341 (6141) ( 2013), p. 1237439

[28]

J.M. Bender, F. Li, H. Adisetiyo, D. Lee, S. Zabih, L. Hung, et al.. Quantification of variation and the impact of biomass in targeted 16S rRNA gene sequencing studies. Microbiome, 6 (1) ( 2018), p. 155

[29]

J. Qin, Y. Li, Z. Cai, S. Li, J. Zhu, F. Zhang, et al.. A metagenome-wide association study of gut microbiota in type 2 diabetes. Nature, 490 (7418) ( 2012), pp. 55-60. DOI: 10.1038/nature11450

[30]

P. Lepage, M.C. Leclerc, M. Joossens, S. Mondot, H.M. Blottiere, J. Raes, et al.. A metagenomic insight into our gut's microbiome. Gut, 62 (1) ( 2013), pp. 146-158. DOI: 10.1136/gutjnl-2011-301805

[31]

R. Poretsky, R.L. Rodriguez, C. Luo, D. Tsementzi, K.T. Konstantinidis. Strengths and limitations of 16S rRNA gene amplicon sequencing in revealing temporal microbial community dynamics. PLoS One, 9 (4) ( 2014), p. e93827. DOI: 10.1371/journal.pone.0093827

[32]

L. Tang, S. Gu, Y. Gong, B. Li, H. Lu, Q. Li, et al.. Clinical significance of the correlation between changes in the major intestinal bacteria species and COVID-19 severity. Engineering, 6 (10) ( 2020), pp. 1178-1184

[33]

L. Breiman. Random forests. Mach Learn, 45 (1) ( 2001), pp. 5-32

[34]

J.H. Friedman. Greedy function approximation: a gradient boosting machine. Ann Stat, 29 (5) ( 2001), pp. 1189-1232

[35]

R.E.S. Yoav Freund. A decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst Sci, 55 (1) ( 1997), pp. 119-139

[36]

T. Chen, C. Guestrin. XGBoost: a scalable tree boosting system. Assoc Comp Machinery ( 2016), pp. 785-794. DOI: 10.1145/2939672.2939785

[37]

C. Cortes, V. Vapnik. Support vector networks. Mach Learn, 20 (3) ( 1995), pp. 273-297

[38]

T.G. Dietterich. Ensemble methods in machine learning. Springer, Berlin Heidelberg ( 2000), pp. 1-15. DOI: 10.1007/3-540-45014-9_1

[39]

Q. Su, Q. Liu, R.I. Lau, J. Zhang, Z. Xu, Y.K. Yeoh, et al.. Faecal microbiome-based machine learning for multi-class disease diagnosis. Nat Commun, 13 (1) ( 2022), p. 6818

[40]

N.R. Shin, T.W. Whon, J.W. Bae. Proteobacteria: microbial signature of dysbiosis in gut microbiota. Trends Biotechnol, 33 (9) ( 2015), pp. 496-503

[41]

M.F. Sun, Y.Q. Shen. Dysbiosis of gut microbiota and microbial metabolites in Parkinson's disease. Ageing Res Rev ( 2018), pp. 4553-4561. DOI: 10.1109/tsp.2018.2855658

[42]

L.V. Hooper, D.R. Littman, A.J. Macpherson. Interactions between the microbiota and the immune system. Science, 336 (6086) ( 2012), pp. 1268-1273. DOI: 10.1126/science.1223490

[43]

V.D. Badal, E.D. Vaccariello, E.R. Murray, K.E. Yu, R. Knight, D.V. Jeste, et al.. The gut microbiome, aging, and longevity: a systematic review. Nutrients, 12 (12) ( 2020)

[44]

E. Biagi, C. Franceschi, S. Rampelli, M. Severgnini, R. Ostan, S. Turroni, et al.. Gut microbiota and extreme longevity. Curr Biol, 26 (11) ( 2016), pp. 1480-1485

[45]

T. Wilmanski, C. Diener, N. Rappaport, S. Patwardhan, J. Wiedrick, J. Lapidus, et al.. Gut microbiome pattern reflects healthy ageing and predicts survival in humans. Nat Metab, 3 (2) ( 2021), pp. 274-286. DOI: 10.1038/s42255-021-00348-0

[46]

Q. Yu, L. Yuan, J. Deng, Q. Yang. Lactobacillus protects the integrity of intestinal epithelial barrier damaged by pathogenic bacteria. Front Cell Infect Microbiol, 5 ( 2015), p. 26

[47]

X. Peng, A. Ed-Dra, Y. Song, M. Elbediwi, R.B. Nambiar, X. Zhou, et al.. Lacticaseibacillus rhamnosus alleviates intestinal inflammation and promotes microbiota-mediated protection against Salmonella fatal infections. Front Immunol, 13 ( 2022), p. 973224

[48]

N. Salazar, M. Gueimonde, R.G. de Los, P. Ruas-Madiedo. Exopolysaccharides produced by lactic acid bacteria and bifidobacteria as fermentable substrates by the intestinal microbiota. Crit Rev Food Sci Nutr, 56 (9) ( 2016), pp. 1440-1453. DOI: 10.1080/10408398.2013.770728

[49]

D. Parada Venegas, M.K. De la Fuente, G. Landskron, M.J. Gonzalez, R. Quera, G. Dijkstra, et al.. Short chain fatty acids (SCFAs)-mediated gut epithelial and immune regulation and its relevance for inflammatory bowel diseases. Front Immunol, 10 ( 2019), p. 277

[50]

D.R. Donohoe, N. Garge, X. Zhang, W. Sun, T.M. O'Connell, M.K. Bunger, et al.. The microbiome and butyrate regulate energy metabolism and autophagy in the mammalian colon. Cell Metab, 13 (5) ( 2011), pp. 517-526

[51]

J. Park, M. Kim, S.G. Kang, A.H. Jannasch, B. Cooper, J. Patterson, et al.. Short-chain fatty acids induce both effector and regulatory T cells by suppression of histone deacetylases and regulation of the mTOR-S6K pathway. Mucosal Immunol, 8 (1) ( 2015), pp. 80-93. DOI: 10.1038/mi.2014.44

[52]

I. Laudadio, V. Fulci, F. Palone, L. Stronati, S. Cucchiara, C. Carissimi. Quantitative assessment of shotgun metagenomics and 16S rDNA amplicon sequencing in the study of human gut microbiome. OMICS, 22 (4) ( 2018), pp. 248-254. DOI: 10.1089/omi.2018.0013

[53]

S. Bartosch, A. Fite, G.T. Macfarlane, M.E. McMurdo. Characterization of bacterial communities in feces from healthy elderly volunteers and hospitalized elderly patients by using real-time PCR and effects of antibiotic treatment on the fecal microbiota. Appl Environ Microbiol, 70 (6) ( 2004), pp. 3575-3581

[54]

Y. He, W. Wu, H.M. Zheng, P. Li, D. McDonald, H.F. Sheng, et al.. Regional variation limits applications of healthy gut microbiome reference ranges and disease models. Nat Med, 24 (10) ( 2018), pp. 1532-1535. DOI: 10.1038/s41591-018-0164-x

[55]

T. Yatsunenko, F.E. Rey, M.J. Manary, I. Trehan, M.G. Dominguez-Bello, M. Contreras, et al.. Human gut microbiome viewed across age and geography. Nature, 486 (7402) ( 2012), pp. 222-227. DOI: 10.1038/nature11053

Funding

the National Key Research and Development Program of China(2018YFC2000500)

the Fundamental Research Funds for the Central Universities(2022ZFJH003)

the Independent Task of State Key Laboratory for Diagnosis and Treatment of Infectious Diseases(2022zz22)

the National Natural Science Foundation of China(81703430)

the National Natural Science Foundation of China(32170058)

the National Natural Science Foundation of China(82200994)

the Chinese Academy of Medical Sciences Innovation Fund for Medical Sciences(2019-I2M-5-045)

the Research Project of Jinan Microecological Biomedicine Shandong Laboratory(JNL-2022051B)

PDF (2910KB)

Supplementary files

Supplementary Material

5103

Accesses

0

Citation

Detail

Sections
Recommended

/