Search Article Keyword:  
PubMed Submission Abstarct PDF Cited  Click Count: 3815 DownLoad Count: 1442 

ISSN 1007-9327 CN 14-1219/R  World J Gastroenterol  2003 Aug;9(8):1646-1656

Progress in searching for susceptibility gene for inflammatory bowel disease by positional cloning

Chang-Qing Zheng, Gang-Zheng Hu, Zhao-Shu Zeng, Lian-Jie Lin, Gin-Ge Gu


Chang-Qing Zheng, Gang-Zheng Hu, Lian-Jie Lin, Gin-Ge Gu, Department of Gastroenterology, the Second Affiliated Clinical College of China Medical University, Shenyang 110001, Liaoning Province, China
Zhao-Shu Zeng, Department of Serology, College of Forensic Medicine, China Medical University, Shenyang 110001, Liaoning Province, China
Correspondence to: Professor Chang-Qing Zheng, Department of Internal Medicine, the Second Affiliated Clinical College of China Medical University, Shenyang 110001, Liaoning Province, China. zhengchangqing88@163.com
Telephone: +86-24-83956682    Fax: +86-24-83956682
Received: 2003-03-03    Accepted: 2003-04-11

Abstract
Inflammatory bowel disease (IBD) includes two clinical subtypes: Crohn disease (CD) and ulcerative colitis (UC). The general prevalence is about 1.0-2.0 % in Western countries. It is predominantly regarded as a multifactorial disorder involving environmental factors and polygenic defects. The view was confirmed by a lot of evidences from clinical attributions and animal models, especially from epidemiological investigations. So the etiological study of IBD has been focused on searching for susceptibility genes by positional cloning, which consists of two steps: linkage analysis and association analysis. Linkage analysis has been an important method of searching for susceptibility genes to polygenic diseases as well as single-gene disorders. IBD, as a polygenic disease, has been widely investigated by linkage analysis for susceptibility gene since 1996. The paper reviewed 38 articles, which covered almost all original researches in relation to IBD and linkage analysis. So far, several loci, such as 16q, 12q, 6p and 3p, have been identified by the studies. The most striking is 16q12 (IBD1), which linked only with CD not UC in the majority of studies. Association analysis, as one essential step for positional cloning, is usually carried out by genotyping candidate genes selected by means of linkage analysis or other methods, for figuring out the frequencies of alleles and comparing the frequencies between IBD group and healthy control group to identify the specific allele. It has been established that IBD is implicated in immune disorder. So the studies were centered on the genes of NOD2/CARD15, HLA-II, cytokine, cytokine receptor and adhesion molecule. This paper reviewed 14 original articles on association between NOD2 and IBD that have been published since 2001. All results, with the exception of one report from a Japanese group, provide evidences that the three kinds of variants of NOD2 are susceptibility factors for IBD. This article also comprehensively analyzed 18 original researches of HLA gene polymorphism in IBD. We found extensive discrepancy among the conclusions and a novel hypothesis was put forward to explain the discordance. Most studies published recently on association between IBD and cytokine gene polymorphism were reviewed.

Zheng CQ, Hu GZ, Zeng ZS, Lin LJ, Gu GG. Progress in searching for susceptibility gene for inflammatory bowel disease by positional cloning. World J Gastroenterol  2003; 9(8): 1646-1656
http://www.wjgnet.com/1007-9327/9/1646.asp

INTRODUCTION
Inflammatory bowel disease (IBD) is composed of two clinical subtypes: Crohn disease (CD) and ulcerative colitis (UC). Its general prevalence is 0.1-0.2 % in Western countries[1]. There has been no epidemiological investigation of large scale for the prevalence or incidence of IBD in China so far, despite the facts that UC is common in China and CD has been more frequently diagnosed by clinical physicians in recent years[2]. The pathogenesis of IBD has not been clearly identified. Today, the most generally accepted pathogenesis of IBD is that IBD is resulted from abnormal immune response to enteric bacteria in individuals with susceptibility due to genetically polygenic defects. Therefore, investigators have searched human genome for the loci of susceptibility genes by linkage analysis and have achieved great success. On the other hand, association analysis, as one of the essential steps for positional cloning, was carried out by many investigators to identify the specific allele. It has established that IBD is implicated in immune disorder. Biochemical substances involved in immunoregulation are very rich and the corresponding genes are widely distributed in genome. Genome-wide linkage analysis has suggested multiple candidate regions in several chromosomes for IBD, therefore, considerable numbers of candidate genes should be selected for association analysis. In recent years, these studies were centered on the genes of NOD2/CARD15, HLA-II, cytokine, cytokine receptor and adhesion molecule. These studies were summarized in this review.

IBD IS A MULTIFACTORIAL DISEASE
There have been a number of hypotheses about the pathogenesis of IBD, but neither environmental factors, such as habit of diet and behavior, infection of microorganisms and contact of chemical or physical pathogenic agents, nor single-gene genetic disorder alone can fully explain its complex phenotypes. Thereby, it is thought to be a multifactorial disease. The view was supported by a larger amount of evidences from clinical attributions and animal models, especially epidemiologic investigations and linkage analyses.

Persuasive evidences of genetic contribution to IBD [3-12]
A. The first-degree relatives of affected individuals show about 20-50-fold increased risk of developing the disease compared with the general population for CD, and 10-20-fold increased risks for UC. Moreover, the affected siblings frequently present at similar ages and concordance rates reach up to 80 % for disease site, behavior and presence of extraintestinal manifestation. B. Twin studies have shown that the concordance rate of CD is about 20-44 % for monozygotic twins, and 3.8-6.5 % for dizygotic twins; the concordance rate of UC is about 6-16 % and 3 % respectively[6,7]. C. There are significant ethnic differences in disease frequency. For instance, the prevalence in Ashkenazi Jews is much higher than that in other races, even though they share similar living environment in the same community[8,9]. D. All genome-scanning linkage analyses detected some linkage loci, certain of which were subsequently confirmed by replication studies only involving certain chromosomes; NOD2 was consistently identified as the susceptibility gene for CD in recent years. E. Simulation studies on animal models have showed that transgenic mice or gene-knockout mice are subject to colitis similar to human IBD, and that spontaneous colitis or hapten-induced colitis manifests fairly different in different strains of mice[10-12].

Evidence of environmental contribution
A. The concordance rate of IBD for monozygotic twins is much less than 100 %. The identical genotype with different phenotypes means that environmental factors take part in the pathogenesis of the disease[6,7]. B. Intestinal bacteria are suggested as the main environmental contributions demonstrated by many evidences: antibiotic therapy can usually induce temporary remission for most IBD cases[13], diversion of faeces stream can make distal improvement in patients with CD[14], some studies suggested that certain strains of intestinal bacteria were associated with IBD[15,16], colonization with normal enteric bacterial flora was required for the occurrence of disease in animals with CD irrespective of the underlying defect[10-12]. C. smoking is likely to be associated with the progress of IBD[17,18]. D. Migrant epidemiological studies demonstrated that population of identical ethnic background, when lived in different communities, showed discordant incidence[8,9,19,20].

IBD is not a disorder of simple mendelian inheritance [3,21-25]
Genetic disease of classic Mendelian model, which consists of Mendelian dominant and recessive genetic disorders, is a phenotype of single-gene defect and called single-gene disorder. IBD has previously been interpreted as genetic disease of Mendelian recessive model. But segregation analyses offered counter-evidence that IBD followed the principle of Mendelian inheritance. Parents of most IBD probands were healthy, frequency of siblings or children of the patients was much less than 50 %, the decline in frequency of affected second-degree relatives compared with first-degree relatives was greater than that predicted by autosomal dominant inheritance, in which the frequency was expected to decrease by 1/2 with each step. Incidence of IBD in children of affected spouses was sharply less than 100 % and a similar proportion of affected siblings and children of affected probands was inconsistent with autosomal recessive inheritance. Linkage analysis has detected several linkage loci that are distributed on a number of chromosomes.

LINKAGE ANALYSIS
It is very difficult to find the biochemical substances, which express qualitative difference between patients and healthy population by means of classical functional cloning. So linkage analysis, as the first step of positional cloning, may serve as a unique and practicable substitution for the time being. Figuring out genetic distance between marked loci and susceptibility gene by means of pedigree investigation and genotyping, then defining the approximate position of susceptibility gene in genomic map are the essential courses of linkage analysis. The dramatic progress of human genome project, which has located nearly 10 000 marker loci in genomic map, has greatly boosted positional cloning for complex genetic diseases. Epidemiological studies have identified striking genetic contributions to the etiology of IBD, but so far, studies with traditional biochemical methods have not yet identified the products with quantitative defects. Many investigators have turned to linkage analysis and have achieved great success. The important data from 38 original researches, which covered almost all articles in relation to IBD and linkage analysis that have been published since 1996, are listed in Table 1[26-63], and some aspects were reviewed as follows.
      The common course of linkage analysis for IBD is: collecting families with affected sibling pairs (ASP) or affected relative pairs (ARP) ≥2 by strict ascertainment, genotyping of genome-wide or certain chromosomes according to microsatellite polymorphisms, figuring out multi-point maximal non-parametric LOD score (MLOD) and two-point LOD score by means of statistical software, inferring genetic distances of susceptibility genes to marker loci and locations in physical genome map, offering candidate genes for association analysis. The majority of investigations found certain suggestive linkage loci with various LOD score, but when defined according to different LOD thresholds, the locations or number of linkage loci were variable. In view of the traits of statistical software and quantity of subjects in most studies, we only displayed the results with MLOD ≥2.0 or 3.0, represented by  and +. The chromosomes, on which the linkage loci strongly supported (MLOD≥3.0) by at least one of 8 linkage analyses of genome-wide scanning located, include chromosomes 1, 3, 5, 6, 7, 12, 14, 16, 18 and 19, as well as chromosomes 4, 10, 17 and x with suggestive evidence (2.0≤MLOD<3.0). Although there was striking discrepancy among the genome-wide scans in respect of linkage loci, almost all studies detected more than 3 linkage loci. This shows that the pathogenesis of IBD is involved in multiple genes and manifests obvious genetic heterogeneity. Several loci were supported by relative more studies, such as 16q, 12q, 6p, and 3p. Because Hugot et al [26] and Satsangi et al [27] detected strong linkage evidences for chromosomes 16, 12, 6, 3 and 7 in 1996, subsequent studies mainly focused on these chromosomes. It can be seen from Table 1 that more evidences were offered for these loci, with the exception of 16q, simply because these loci were investigated by more studies. Some loci supported by certain genome-wide scans, such as 14q, 5q, 19p, likely to harbor susceptibility genes, were less investigated.
      Stratification studies demonstrated significant variances as to the degree and loci of linkage between families with severe IBD and those with only slight IBD, male patients and female patients, Jewish people and non-Jewish people, as well as between UC and CD. Some investigators examined families with CD patients only; others examined families with UC patients only, but most studies detected both families and those with mixed patients and compared the differences of linkage loci between the two groups. As shown in Table 1, there were some differences between UC and CD. The most striking is 16q12 (IBD1), which linked only with CD not UC in the majority of studies. This shows that CD and UC have some common susceptibility genes, as well as certain individual susceptibility genes. Three studies[32,47,48] found that certain loci linked only with the families with early onset of CD. All subjects examined by the studies listed in Table 1 included Caucasian or Jewish patients from Europe, Australia and northern America, but no Mongolian patients. Three studies[28,29,42] demonstrated significant differences between Jewish patients and non-Jewish patients. In respect of nationality of patients, it seems there are no remarkable differences among American, English, German, Australian, Canadian, Italian, Dutch and Belgian. But Paavola et al [40] examined chromosomes 1, 3, 7, 12, 14 and 16 in Finnish patients and did not find linkage loci. Fisher et al[33] found that some loci on chromosomes 6p, 1, 14 and 18 linked only with IBD of male sufferer. These results confirm the extensive genetic heterogeneity of IBD.
      Linkage analysis is intended as an essential tactic to offer candidate genes for association analysis. We should focus our attention on the linkage loci containing some candidate genes, products of which have been suggested as pathogenic factors by other methods, as well as confirmed by subsequent replication studies. The loci meeting these conditions were briefly reviewed here.

Table 1  Data of linkage analysis

Ref Author Year Subject Scope Linkage loci for IBD Linkage loci for CD Linkage loci for UC
R26  Hugot JP 1996 Caucasian CD Autosome 16q(IBD1)+
R27  Satsangi J 1996 Northern european IBD Autosome 7+, 12+, 3± 7, 12, 3± 7+
R28  Cho JH 1998 American IBDa Genome (3q+,1p)(non-Jewish),(3q,4q)e 16± - -
R29  Ma Y 1999 American CDa Genome  14q,17q±e,5q±e
R30  Hampe J 1999 European IBDb Genome 1, 6, X± 10, 12, 16± 4, X±
R31  Duerr RH 2000 American CDa Genome 14q+
R32  Rioux JD 2000 Canadian IBD Genome 19p+, 5q+, 3p, 6p± 5q+f , 19p+ 19p±
R33  Fisher SA 2002 European IBD Genome (6p+,1+,14+,18+)(male) 6p+(male) 6p+(male)
R34  Brant SR 1998 American CDa 3,7,12,16 16q(IBD1)±
R35  Rioux JD 1998 Toronto IBD 3,7,12,16 - - - - - -
R36  Curran ME 1998 European IBDb 12,16 - - 12q± - -
R37  Annese V 1999 Italian IBD 3,6,7,12,16 16q± 16q± 16q±
R38  Vermeire S 2000 Belgian CD 3,7,12,16 - -
R39  Dechairo B 2001 European IBD 3,6,7, 6p+ - - - -
R40  Paavola P 2001 Finnish IBD 1,3,7,12,14,16 - - - - - -
R41  Gavanaugh J 2001 IBDc 12,16 16q+ 16q+ - -
R42  Ohmen JD 1996 American IBDa 16 - - 16q+e - -
R43  Parkes M 1996 English IBD 16 - - 16q± - -
R44  CavanaughJA 1998 Australian CD 16 16q+(IBD1)
R45  Mirza MM 1998 Northern european UC 16 16q(IBD1)
R46  Porabosco P 2000 Italian IBD 16 16q+ 16q+ 16q+
R47  Brant SR 2000 American CD 16 16q+f , 16q±
R48  Akollar PN 2001 Jewish CD 16 16q+f
R49  Van Heel DA 2002 European CD 16 - -
R50  Zouali H 2001 European CD 16 16q+
R51  Hampe J 2002 European IBD 16 16p± 16q+, 16p± - -
R52  Satsangi J 1996 European IBD 6(MHC-II)  - - - - 6p(MHC-II)+
R53  Silverber MS 1999 Canadian CD 6 6p(MHC-II) ±
R54  Hampe J 1999 Northern european IBDb 6 6p+ 6p+ 6p+
R55  Yang H 1999 American CD 6 6p(MHC)+
R56  Duerr RH 1998 Northern american IBD 12 12q± - - - -
R57  Yang H 1999 American IBD 12 - - 12q± - -
R58  Lesage S 2000 Northern european CDd 12 - -
R59  Parkes M 2000 American IBD 12 12q+ - - 12q +
R60  Hampe J 2001 Northern european IBDb 3 - - - - - -
R61  Duerr RH 2002 American IBD 3 3p+ - - - -
R62  Rioux JD 2001 American CD 5q 5q+
R63  Vermeire S 2001 Belgian CD X Xq±

Note: CD, CD-only family; UC, UC-only family; IBD=UC+CD+mixed family; +, convincing linkage (LOD3.0);± , suggestive linkage (3.0>LOD2.0); - -, no suggestive linkage (LOD<2.0); a, including Jewish; b, family from English, German and Dutch; c, family from Northern American, European and Australian; d, family from French and Belgian; e, linkage only for Jewish; f, linkage only for IBD with early onset.

Chromosome 16  As shown in Table 1, 14 out of 25 related studies found linkage loci for CD with MLOD more than 2.0 on the chromosome, additionally, some loci with suggestive score (MLOD between 1.0 and 2.0) were detected. Only 3 studies found linkage loci with UC, furthermore, 2 of them also detected linkage with CD, and the other one merely examined UC families. It can be inferred from these studies that chromosome 16 contains susceptibility gene for CD rather than UC. Chromosome 16 is comparatively short, with 98 Mb of physical length, 130.8 cm of genetic distance, and has been spaced by about 200 microsatellite markers[64]. The linkage loci suggested by the studies in Table 1 were distributed in most part of chromosome 16 (for instance, D16S409-419[26], D16S748-764[28], D16S411[42,43], D16S3136[50]), but only pericentromeric region on 16q was the most consistent linkage region. The important candidate genes in the region are NOD2, CD11integrins, CD19, Sialophorin, IL-4 receptor gene etc. NOD2 gene has been established as susceptibility gene to CD. It remains unanswered if there are other susceptibility genes in the chromosome. Hampe et al[51] examined additional regions with high-density experiment using 39 microsatellite markers and found three-peak logarithm of odds (LOD) scores of 2.7, 3.2, and 3.1 on proximal 16p, proximal 16q, and central 16q, respectively. Taking account of the differences of suggestive markers, it is probable that there are other susceptibility genes for CD in the chromosome.

Table 2  Data of association analysis for NOD2

Ref year author

Subject

Main conclusion

R65, 2001

Europe CD, UC

A. Find P241S, R432R, R675W, G881R, IVS8-133delAinSCT, 980fs etc.31 variants

Hugot, JP

 

B. R675W, G881R, 980fs with CD, not with UC

 

 

C. CD-GRR 3.0 at SHEM, 38.0 at HOM, 44.0 at CHEM

R66, 2001

American CD

A. 3020insC with CD

Ogura Y

 

B. CD-GRR 1.5 at SHEM, 17.6 at HOM

R67, 2001

German, English CD

A. 3020insC with CD, not with UC

Hampe J

UC

B. CD-GRR 2.6 at SHEM, 42.1 at HOM

R68, 2002

Europe CD, UC

A. Find 67 sequence variants, 9 of which gene frequency >5 %

Lesage S

 

B. R702W, G908R, 3020insC with CD, not with UC

 

 

C. Support gene-dosage effect

R69, 2002

Europe CD, UC

A. R702W, G908R, 3020insC with CD, especially ileum CD, not with UC

Cuthbert AP

 

B. P628S linkage disequilibrium with the other three mutations

 

 

C. CD-GRR 3.0 at SHEM, >22.0 at HOM or at CHEM

 

 

D. Mutation frequency: familial CD > sporadic CD

R70, 2002

Dutch CD

A. 3020insC with CD, G2722C not with CD

Murillo L

 

B. No association with clinical phenotype

R71, 2002

German, Norwegian

A. R675W, G881R, 980fs with CD

Hampe J

CD

B. Especially with ileum CD

R72, 2002

Canadian CD

A. R702W, G908R, 1007fs with CD, especially ileum CD

Vermeire S

 

B. No difference between familial CD and sporadic CD

R73, 2002

German UC, CD

A. 3020insC with CD, not with UC

Radlmayr M

 

B. Association with fistula, fibrostenosis, ileocecum resection

R74, 2002

 

Japanese CD, UC A. R675W, G881R, 3020insC not with CD or UC

Inoue N

 

 

R75, 2002

Europe CD

A. R675W, G881R, 3020insC with CD

Vermeire S

 

B. Not with effect of Infliximb

R76, 2002

American CD

A. R702W, G908R, 1007fs with CD

Abreu MT

 

B. With fibrostenosis

R77, 2002

English CD

A. R702W, G908R, 1007fs with CD, especially ileum CD

Ahmad T

 

B. 3020insC with early onset of CD

 

 

C. CD-GRR 2.4 at SHEM, 9.8 at HOM, 29.3 at CHEM

R49, 2002

Europe CD

A. R702W, 1007fs with CD, linkage disequilibrium with P628S

Van heel DA

 

B. Support gene-dosage effect

  Chromosome 12  Six out of 19 studies found suggestive linkage loci in the chromosome, 4 of them for CD, and one for UC. Though the studies with suggestive MLOD are rare, several studies found linkage loci with slightly suggestive significance (MLOD between 1.0 and 2.0) with IBD, especially with UC. This may result from the fact that the sample sizes in most studies were not large enough; furthermore, they were predominantly consisted of CD families. Parkes et al[59] examined 581 affected relative pairs, of which 252 were from CD-only families, 138