This article is an open-access article which was selected by an in-house editor and fully peer-reviewed by external reviewers. It is distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/
Author contributions: All authors equally contributed to this paper with conception and design of the study, literature review and analysis, drafting and critical revision and editing, and final approval of the final version.
Supported bythe Programma Ricerca Budget prestazioni Eurac 2017 of the Province of Bolzano, Italy.
Conflict-of-interest statement: No potential conflicts of interest.
Open-Access: This is an open-access article that was selected by an in-house editor and fully peer-reviewed by external reviewers. It is distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/
Corresponding author: Frank Maixner, PhD, Senior Researcher, Institute for Mummy Studies, EURAC Research, Viale Druso 1, Bolzano 39100, Italy. firstname.lastname@example.org
Telephone: +39-471-055567 Fax: +39-471-055579
Received: June 28, 2019 Peer-review started: June 28, 2019 First decision: July 20, 2019 Revised: September 13, 2019 Accepted: November 1, 2019 Article in press: November 1, 2019 Published online: November 14, 2019
The bacterium Helicobacter pylori (H. pylori) infects the stomachs of approximately 50% of all humans. With its universal occurrence, high infectivity and virulence properties it is considered as one of the most severe global burdens of modern humankind. It has accompanied humans for many thousands of years, and due to its high genetic variability and vertical transmission, its population genetics reflects the history of human migrations. However, especially complex demographic events such as the colonisation of Europe cannot be resolved with population genetic analysis of modern H. pylori strains alone. This is best exemplified with the reconstruction of the 5300-year-old H. pylori genome of the Iceman, a European Copper Age mummy. Our analysis provided precious insights into the ancestry and evolution of the pathogen and underlined the high complexity of ancient European population history. In this review we will provide an overview on the molecular analysis of H. pylori in mummified human remains that were done so far and we will outline methodological advancements in the field of ancient DNA research that support the reconstruction and authentication of ancient H. pylori genome sequences.
Core tip: The molecular analysis of ancient human remains holds the potential to reconstruct Helicobacter pylori genomes dating to various time periods from all over the world. This will provide precious insights into the virulence evolution and population history of this important stomach pathogen.
Citation: Maixner F, Thorell K, Granehäll L, Linz B, Moodley Y, Rattei T, Engstrand L, Zink A. Helicobacter pylori in ancient human remains. World J Gastroenterol 2019; 25(42): 6289-6298
One of the oldest and most common members of the human stomach microbial community is the pathogen Helicobacter pylori (H. pylori). This spiral shaped bacterium colonizes the stomachs of approximately half the world’s population and is the primary cause of peptic ulcers and gastric cancer. Modern H. pylori strains cluster into multiple, distinct populations based on their geographic origin[2,3]. During H. pylori evolution, founder effects, introgression and geographic separation significantly shaped modern populations and despite very high within-population genetic diversity, genetic signals for more ancient admixture events still persist. Interestingly, the ancestry proportions in H. pylori correlate well with those in associated human populations, indicating that the pathogen accompanied its host on migrations over the last 100000 years. The distinct phylogeographic pattern observed for H. pylori renders this bacterium a powerful marker for tracing complex demographic events in human prehistory, especially those that occurred in hybrid zones where isolates from several ancestral populations admixed. One of these complex demographic histories are still visible in the modern European H. pylori population (hpEurope), which was shown to have arisen through large-scale admixture after secondary contact between two ancestral populations: Ancestral Europe (AE) 1 and AE2. Today, hpEurope is indigenous to Europe, North Africa and western Asia, but one or other of its ancestral components are found at highest frequency in areas that geographically fringe this range. Thus, strains with a high AE1 component have been isolated in Finland, Kazakhstan and northern India belonging to the modern hpAsia2 population, whereas highly AE2 strains are predominantly found in North-East Africa and the Sahel, and belong to the modern hpNEAfrica population. However, since several waves of migrations are known to have shaped the population structure of European people[8-10], it remains controversial which colonizing human group brought with it AE1 and which AE2. Given that modern day humans living in and around the Fertile Crescent are all infected with hpEurope, it would seem even more likely that the AE1-AE2 admixture occurred somewhere in the western Asia, and later colonized Europe in the stomachs of Neolithic farmers.
The analysis of ancient H. pylori genome sequences adds new precious mosaic pieces to the understanding of these complex demographic events. Our recent study of H. pylori in the Iceman, a 5300-year-old Copper age individual, provided important details on a possible disease manifestation in the mummy and the origin of the stomach pathogen in Europe. In this review we will give an overview on the H. pylori diagnostics that have been performed so far on mummified human remains and we will outline methodological and analytical advancements in the field of ancient DNA research that support the reconstruction and authentication of ancient H. pylori genome sequences.
DETECTION OF H. PYLORI IN ANCIENT HUMAN REMAINS
Currently, there exist only few studies that provided evidence for the presence of H. pylori biomolecules (DNA, proteins) in ancient human remains (Table 1). In contrast to other pathogens such as Yersina pestis or Mycobacterium tuberculosis[11,12], H. pylori cells or biomolecules seem not to become distributed during infection via the blood stream into different skeletal parts (e.g., tooth, vertebra). Therefore, all studies so far diagnosed ancient H. pylori biomolecules at the actual site of infection and its surroundings, in mummified stomach tissues and intestinal contents.
Table 1 Mummified human remains diagnosed Helicobacter pylori positive.
AD: Anno domini; BC: Before present; AB: Antibody; UreB: Urease subunit beta; VacA: Vacuolating cytotoxin A; CagA: Cytotoxin-associated gene A; H. pylori: Helicobacter pylori.
The first evidence for the existence of H. pylori biomolecules in ancient human remains came from a study that analysed 1700-year-old stool samples from South America. By using a commercial ELISA kit (Helicobacter pylori stool assay, Meridian Diagnostics) Correa and colleagues unambiguously detected H. pylori antigens in two out of 16 stool specimens. The two H. pylori positive specimens came from a 25-year-old mummified male that belonged to the Cabuza culture in Chile. Another study of four pre-Columbian mummies from Mexico using a PCR-hybridisation assay targeting the 16S rRNA and ureB gene confirmed the presence of H. pylori DNA in gastric tissues of a 50 to 60-year-old male dated to approx. 1350 AD.
The next two studies not only diagnosed H. pylori DNA in ancient specimen but also characterized two major virulence factors of the stomach pathogen by targeting the cytotoxin-associated gene A (CagA) and vacuolating cytotoxin A (VacA) using a PCR-based assay. Molecular analysis on stomach tissue of the Kwäday Dan Tsi’nchi individual, a 200 to 300-years-old frozen male mummy discovered in a glacier in British Colombia, Canada, revealed the presence of an ancient H. pylori strain carrying a hybrid vacA m2a/m1d middle (m) region allele and a vacA s2 signal (s) region allele. This unusual allele combination, having a hybrid middle allele clustering to Asian allele variants and a s2 signal region more common to Western strains, suggests that the ancient strain is a possible hybrid between Asian and European strains. This observation would support the assumption that this first nation individual is a descendent of the first humans who migrated into the New World by crossing over the Bering Strait from Asia and that he already had early European contact. A second study screened stomach specimens from eight excellently well preserved 17th century Joseon Korean mummies for the presence of H. pylori. The applied PCR-assay identified the vacA allele type of two mummies, the Cheongdo mummy (vacA s1/m2) and the Dangjin mummy (vacA s1). While the vacA s1 allele is one of the most frequently detected subtypes in the modern South Korean populations, the m2 subtype is only found in 12% of H. pylori patients in South Korea[17,18].
Notably, both studies on the Kwäday Dan Tsi’nchi individual and on the Korean mummies report the absence of the cagA gene, that encodes one of the major effector proteins in the H. pylori cag pathogenicity island. It is known that cagA is present in only approximately 70% of modern H. pylori strains worldwide. Interestingly, this rate varies geographically from approximately 90% in East Asian countries to only about 40% in Western Countries. Whether this observation in the ancient H. pylori strains is true or only an artefact due to the highly degraded state of the ancient DNA remains to be determined.
The most comprehensive study of H. pylori in ancient human remains was the reconstruction and analysis of the 5300-year-old H. pylori genome of the Iceman. The Tyrolean Iceman, commonly referred to as “Ötzi”, is one of the oldest human mummies discovered (Figure 1). His body was preserved for more than five millennia in an Italian Alpine glacier before he was discovered by two German mountaineers at an altitude of 3,210 m above sea level in September 1991[21,22]. The discovery of the Iceman is extremely valuable for scientists, not only because of his historical age and the range of objects he was carrying when he died (clothing, hunting equipment such as an axe, dagger, a bow and quiver of arrows), but also the way he was preserved over time. The Iceman is a so-called “ice mummy”, i.e., humidity was retained in his cells while he was naturally mummified by freeze-drying. The body tissues and intestines are therefore still well preserved, and this feature makes them suitable for various modern scientific investigations[24-29]. During a recent radiological re-examination, the Iceman´s stomach was identified and shown to be completely filled. Endoscopy-guided biopsy samples of the Iceman’s stomach wall and content were taken with subsequent microscopic and molecular analysis to identify the nature of the Iceman´s last meal and to determine the presence of H. pylori. First histological analysis of the Iceman´s stomach mucosa revealed only remnants of connective tissue without any further structural details. No human cellular structures or attached bacterial cells were present in the ancient tissue. Further PCR based diagnostics targeting a fragment of the vacA gene revealed traces of H. pylori DNA only in the stomach content material and not in the stomach mucosa tissue. Therefore, we hypothesize that the bacterial cells and biomolecules most presumably detached post-mortem from the mucosa surface and accumulated in the stomach content.
Figure 1 Molecular workflow applied to the Iceman intestinal content samples.
Both “untargeted” shotgun metagenomic next generation sequencing and “targeted” capture-enrichment have been used to identify endogenous Helicobacter pylori DNA and to reconstruct the ancient pathogen genome.
Since we could not exclude the possibility that this first PCR-based positive result may have also come from an external contamination with modern H. pylori cells we decided to extend our molecular approach to a metagenomics diagnostic assay. As outlined by Spyrou and colleagues in their excellent review on ancient pathogen genomics includes this assay two major steps, first a shotgun-based “untargeted” metagenomic screening for the presence of authentic pathogen DNA and second a “targeted” capture-sequencing based enrichment approach aiming to reconstruct ancient pathogen genomes (Figure 1). Importantly, this approach allows not only an appropriate authentication of the detected H. pylori DNA but also provides the unique opportunity to perform comparative sequence analysis on a whole genome level (as outlined in the next two chapters). First metagenomic analysis of 12 biopsy samples from the gastrointestinal tract of the Iceman yielded endogenous ancient H. pylori DNA in all gastrointestinal tract contents. Importantly, the H. pylori reads did not only cover most of the reference genome sequence, but also followed the expected physiological distribution pattern, with abundance steeply decreasing as samples move away from the stomach towards the lower large intestinal tract. Thus, even after 5300 years, the distribution of H. pylori in the Iceman´s gastrointestinal tract is homologous to the distribution found in modern H. pylori-positive humans. In modern patients with peptic ulcers, H. pylori can account for > 93% of the bacterial reads of the stomach microbiome. Due to leakage from the stomach into the intestines the depleted H. pylori antigens and DNA can still be used for diagnostics in stool samples. After having confirmed the presence of authentic endogenous H. pylori DNA we subjected Iceman stomach content biopsies to a capture-sequencing based enrichment approach. To account for the high genomic variability between H. pylori strains, we implemented the genomes of nine different H. pylori multi locus sequence types (MLSTs), representing the pathogen´s known global diversity, in the RNA based in solution target enrichment assay. Using this capture design, we successfully enriched H. pylori sequences up to 216-fold which finally allowed us to reconstruct 92.2% of the 1.6-Mb H. pylori reference genome with an 18.9-fold average coverage.
AUTHENTIFICATION OF ANCIENT H. PYLORI GENOME SEQUENCES
The advent of high-throughput sequencing (HTS) has greatly improved the field of ancient DNA research. In contrast to PCR based methods, where single DNA regions are targeted, HTS has the possibility to sequence whole genomes and metagenomes. By using HTS it is also possible to target shorter fragments of DNA, which is one of the typical features of ancient DNA, as well as authenticate ancient endogenous sequences and distinguish them from modern or environmental contamination.
There are several features of ancient DNA which can be used to distinguish between endogenous and contaminant DNA. Firstly, in contrast to modern DNA, ancient DNA has been degraded and typically has a very short sequence length, as can be seen in the H. pylori sequences from the Iceman gut content (Figure 2A). Secondly, the degradation of DNA also result in specific damage patterns at the ends of the sequence reads. In comparison to the reference genome, ancient DNA have an accumulation of C to T substitutions toward the 5’ end and G to A substitutions toward the 3’ end. These characteristic C to T substitutions are present in the H. pylori sequences from the Iceman gut, as well as the human sequences from the Iceman himself (Figure 2B), indicating their ancient origin.
Figure 2 Ancient DNA characteristics used for authenticating endogenous sequences.
Visualised with Helicobacter pylori (H. pylori) sequences enriched from the gut of the Iceman and aligned to the H. pylori 26695 reference genome: A: Read length distribution. B: C to T substitutions towards the 5’ end of sequence reads (Enriched H. pylori reads, metagenomic H. pylori reads, human Iceman genomic reads). C: Mean genome coverage and distribution of the aligned sequences. H. pylori: Helicobacter pylori.
Further validation of ancient sequences can be done by studying the distribution patterns of species assigned sequences, i.e., how well distributed the sequences are across the reference genome. The even distribution of the Iceman H. pylori sequences across the H. pylori 26696 reference genome (Figure 2C), suggests that most parts of the genome are still present. Clustering of sequences in a few regions of the genome (stacking effect) would instead have suggested an incorrect assignment due to unspecific homology only. In summary, the above-mentioned characteristics allow the proper authentication of ancient endogenous pathogen DNA which opens the possibility for further comparative sequence analysis on a whole genome level.
COMPARATIVE SEQUENCE ANALYSIS USING ANCIENT H. PYLORI DNA
First, the Iceman H. pylori virulence factors cagA and vacA were subjected to comparative sequence analysis with modern H. pylori allele types. Depending on the presence or absence of cagA and vacA in the genome and based on different allele types, H. pylori strains can be associated with varying host tropism and virulence. Based on the current understanding of the allele types and the role of the H. pylori virulence factors CagA and VacA, the Iceman H. pylori can be classified as cagA-positive vacA s1a/i1/m1 type strain associated with a significantly increased inflammation within the gastric mucosa[18,40]. Further paleo-proteomic analysis revealed the presence of an intestinal inflammation marker protein indicating active inflammatory host response in the Iceman stomach strongly associated with H. pylori infection.
Next, we assigned the ancient strain to a modern H. pylori population. For this, we extracted the seven (MLST) loci from the ancient genome and compared the gene fragments, that are used for population and subpopulation differentiation among H. pylori[2,6], with a MLST database of 1603 H. pylori strains using the Bayesian population assignment software STRUCTURE. The comparison to a global set of modern H. pylori strains assigned the 5300-year-old bacterium to the modern population hpAsia2 which is today commonly found in Central and South Asia (Figure 3A). This result was further confirmed by a more focused analysis of the strains belonging to the populations hpAsia2, hpEurope and hpNEAfrica (Figure 3B). The assignment to hpAsia2 was rather surprising since stomachs of modern Europeans are pre-dominantly colonized by recombinant hpEurope strains originating from the two ancestral populations AE1 and AE2. By applying the STRUCTURE linkage model, that was designed to determine levels of ancient admixture, we could show that 95% of the nucleotides making up the Iceman’s H. pylori strain were derived from AE1 (Figure 3C). This could suggest that this ancestral component was more widespread in Europe 5000 years ago compared to the present day, where extensive sampling throughout Europe and western Asia, only recovered four strains, each with an unusually high AE1 component (from the Netherlands, Estonia, Finland, and Kazakhstan, blue arrows Figure 3B and C).
Figure 3 Population assignment of the ancient Helicobacter pylori strain using multilocus sequence typing.
A: Comparison of the Iceman strain to a collection of strains representing the worldwide Helicobacter pylori (H. pylori) populations. The proportion of the Iceman strain is displayed on the right. B: Comparison of the Iceman strain to the H. pylori populations hpNEAfrica, hpEurope, hpAsia2. C: STRUCTURE linkage model analysis showing the proportion of Ancestral Europe (AE) 1 (from Central Asia) and AE2 (from northeast Africa).
Further high-resolution analysis of ancestral motifs using fineSTRUCTURE showed that the ancient H. pylori genome shares high levels of ancestry with Indian hpAsia2 strains, but even higher co-ancestry with most European hpEurope strains. Furthermore, the fineSTRUCTURE analysis supports the MLST results by showing a low ancestry of the Iceman strain with the hpNEAfrica strain, a modern re-presentative of AE2. Comparative whole genome phylogeny based on 1121 core genes and 375 publicly available H. pylori genomes from Europe, Asia and Africa assigns the Iceman to Northern European genomes such as genomes from the United Kingdom and Sweden (Figure 4). Both these modern Northern European genomes and the Iceman share a high AE1 ancestry which let them group close to modern hpAsia2 genomes. Overall, we hypothesize that the Iceman’s strain belonged to a prehistoric European branch of hpAsia2 that is different from the modern hpAsia2 population from northern India.
Figure 4 Helicobacter pylori core genome tree based on 1121 core genes with 375 publicly available Helicobacter pylori genomes from Europe, Asia and Africa including the Iceman (in bold).
The core genome analysis was performed using the Roary pan-genome analysis tool followed by phylogenomic analysis using the PhyML tool v 3.1.
Previously it was assumed that the admixture of the two hpEurope ancestors AE1 and AE2 has happened in the Middle East or Western Asia between 10000 and 52000 years ago and hpEurope was introduced into Europe with the first Early Neolithic farmers. Our study, however, has shown that the Iceman’s H. pylori is a nearly pure representative of the bacterial population of Asian origin (AE1) that existed in Europe before hybridization. This suggests that the African ancestral H. pylori component (AE2) arrived in Europe within the past few thousand years, which is much more recent than previously hypothesized.
The power of modern molecular biology methods for studying ancient remains, coupled with the increasing repertoire of advanced data analysis approaches in computational biology, has enabled the reconstruction of ancient genomes of human pathogen including Helicobacter pylori. In contrast to the first PCR-based studies on ancient human remains open metagenomic studies with whole ancient pathogen genomes a completely new range of possible comparative sequence analysis that provide information to a yet unprecedented depth. Our metagenomic diagnostic approach and genome reconstruction revealed that the 5300-year-old H. pylori strain of the Iceman was a potentially virulent strain that is today strongly associated with gastric disease. Furthermore, comparative sequence analysis with modern H. pylori strains provided interesting insights into the ancestry and evolution of the pathogen and underlined the high complexity of ancient European population history. Encouraged by these first very promising results, we are currently collecting and analysing further ancient gut contents and coprolites. Overall, further ancient H. pylori genome sequences from all over the world will significantly contribute to our understanding of the basic biology and evolution of this important stomach pathogen and will add subsequently an additional chapter to the human demographic history from a bacterial point of view.
Manuscript source: Invited Manuscript
Specialty type: Gastroenterology and hepatology
Country of origin: Italy
Peer-review report classification
Grade A (Excellent): A, A
Grade B (Very good): B
Grade C (Good): C
Grade D (Fair): 0
Grade E (Poor): 0
P-Reviewer: Lee CL, Slomiany BL, Tarnawski AS, Zhu YL S-Editor: Tang JZ L-Editor: A E-Editor: Zhang YL
Haak W, Lazaridis I, Patterson N, Rohland N, Mallick S, Llamas B, Brandt G, Nordenfelt S, Harney E, Stewardson K, Fu Q, Mittnik A, Bánffy E, Economou C, Francken M, Friederich S, Pena RG, Hallgren F, Khartanovich V, Khokhlov A, Kunst M, Kuznetsov P, Meller H, Mochalov O, Moiseyev V, Nicklisch N, Pichler SL, Risch R, Rojo Guerra MA, Roth C, Szécsényi-Nagy A, Wahl J, Meyer M, Krause J, Brown D, Anthony D, Cooper A, Alt KW, Reich D. Massive migration from the steppe was a source for Indo-European languages in Europe.Nature. 2015;522:207-211.
[PubMed] [DOI][Cited in This Article: ][Cited by in Crossref: 888][Cited by in F6Publishing: 424][Article Influence: 148.0][Reference Citation Analysis (0)]
Bos KI, Harkins KM, Herbig A, Coscolla M, Weber N, Comas I, Forrest SA, Bryant JM, Harris SR, Schuenemann VJ, Campbell TJ, Majander K, Wilbur AK, Guichon RA, Wolfe Steadman DL, Cook DC, Niemann S, Behr MA, Zumarraga M, Bastida R, Huson D, Nieselt K, Young D, Parkhill J, Buikstra JE, Gagneux S, Stone AC, Krause J. Pre-Columbian mycobacterial genomes reveal seals as a source of New World human tuberculosis.Nature. 2014;514:494-497.
[PubMed] [DOI][Cited in This Article: ][Cited by in Crossref: 330][Cited by in F6Publishing: 195][Article Influence: 47.1][Reference Citation Analysis (0)]
Maixner F, Krause-Kyora B, Turaev D, Herbig A, Hoopmann MR, Hallows JL, Kusebauch U, Vigl EE, Malfertheiner P, Megraud F, O'Sullivan N, Cipollini G, Coia V, Samadelli M, Engstrand L, Linz B, Moritz RL, Grimm R, Krause J, Nebel A, Moodley Y, Rattei T, Zink A. The 5300-year-old Helicobacter pylori genome of the Iceman.Science. 2016;351:162-165.
[PubMed] [DOI][Cited in This Article: ][Cited by in Crossref: 143][Cited by in F6Publishing: 88][Article Influence: 28.6][Reference Citation Analysis (0)]
Keller A, Graefen A, Ball M, Matzas M, Boisguerin V, Maixner F, Leidinger P, Backes C, Khairat R, Forster M, Stade B, Franke A, Mayer J, Spangler J, McLaughlin S, Shah M, Lee C, Harkins TT, Sartori A, Moreno-Estrada A, Henn B, Sikora M, Semino O, Chiaroni J, Rootsi S, Myres NM, Cabrera VM, Underhill PA, Bustamante CD, Vigl EE, Samadelli M, Cipollini G, Haas J, Katus H, O'Connor BD, Carlson MR, Meder B, Blin N, Meese E, Pusch CM, Zink A. New insights into the Tyrolean Iceman's origin and phenotype as inferred by whole-genome sequencing.Nat Commun. 2012;3:698.
[PubMed] [DOI][Cited in This Article: ][Cited by in Crossref: 288][Cited by in F6Publishing: 179][Article Influence: 32.0][Reference Citation Analysis (0)]
Maixner F, Turaev D, Cazenave-Gassiot A, Janko M, Krause-Kyora B, Hoopmann MR, Kusebauch U, Sartain M, Guerriero G, O'Sullivan N, Teasdale M, Cipollini G, Paladin A, Mattiangeli V, Samadelli M, Tecchiati U, Putzer A, Palazoglu M, Meissen J, Lösch S, Rausch P, Baines JF, Kim BJ, An HJ, Gostner P, Egarter-Vigl E, Malfertheiner P, Keller A, Stark RW, Wenk M, Bishop D, Bradley DG, Fiehn O, Engstrand L, Moritz RL, Doble P, Franke A, Nebel A, Oeggl K, Rattei T, Grimm R, Zink A. The Iceman's Last Meal Consisted of Fat, Wild Meat, and Cereals.Curr Biol. 2018;28:2348-2355.e9.
[PubMed] [DOI][Cited in This Article: ][Cited by in Crossref: 23][Cited by in F6Publishing: 7][Article Influence: 7.7][Reference Citation Analysis (0)]
Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data.Genetics. 2000;155:945-959.
[PubMed] [DOI][Cited in This Article: ]
Falush D, Stephens M, Pritchard JK. Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies.Genetics. 2003;164:1567-1587.
[PubMed] [DOI][Cited in This Article: ]