Background

The Guinea pig (Cavia porcellus) is one of the most extensively used animal models to study infectious diseases. However, despite its tremendous contribution towards understanding the establishment, progression and control of a number of diseases in general and tuberculosis in particular, the lack of fully annotated guinea pig genome sequence as well as appropriate molecular reagents has severely hampered detailed genetic and immunological analysis in this animal model.

Results

By employing the cross-species hybridization technique, we have developed an oligonucleotide microarray with 44,000 features assembled from different mammalian species, which to the best of our knowledge is the first attempt to employ microarray to study the global gene expression profile in guinea pigs. To validate and demonstrate the merit of this microarray, we have studied, as an example, the expression profile of guinea pig lungs during the advanced phase of M. tuberculosis infection. A significant upregulation of 1344 genes and a marked down regulation of 1856 genes in the lungs identified a disease signature of pulmonary tuberculosis infection.

Conclusion

We report the development of first comprehensive microarray for studying the global gene expression profile in guinea pigs and validation of its usefulness with tuberculosis as a case study. An important gap in the area of infectious diseases has been addressed and a valuable molecular tool is provided to optimally harness the potential of guinea pig model to develop better vaccines and therapies against human diseases.

Pioneering research of Robert Koch in the guinea pigs (Cavia porcellus) laid the foundation of microbiology and popularized this animal model for the study of various infectious diseases[1]. Since then, guinea pig has proven to be an indispensable tool in studying a range of infectious diseases, such as tuberculosis (TB) (Mycobacterium tuberculosis), anthrax (Bacillus anthracis), diphtheria (Corynebacterium diphtheriae), legionellosis (Legionella pneumophila), sexually transmitted diseases such as chlamydia infection (Chlamydia trachomatis), syphilis (Treponema pallidum) etc.[1]. These studies showed that guinea pigs bear substantial similarity to humans with respect to thymic and bone marrow physiology, innate immune responses, complement system, lung physiology, corticosteroid response, requirement of exogenous source of vitamin C and delayed-type hypersensitivity reaction to infections[1, 2]. Due to these features, guinea pigs emerged as one of the best models for testing bio-defense agents and vaccines and for toxicological studies[1]. However, despite the tremendous contribution of this animal to medical research, the paucity of immunological reagents and molecular tools and the lack of fully annotated guinea pig genome sequence has severely hampered a holistic analysis of host responses in this model. Unlike the mouse model, gene deletion technology (for example, gene knockout and knock-in) and trans-gene expression is also unavailable in case of guinea pigs. Thus, considering the biological relevance of guinea pig model in a large number of infectious diseases, the importance of developing a comprehensive guinea pig microarray cannot be over emphasized. Recently, Tree and colleagues developed an oligonucleotide microarray comprising of 86 genes to study the host response to BCG vaccination in guinea pigs, however, despite its usefulness in immune response studies, it has limited applications due to a small number of genes in the array[3, 4]. In this study, we have overcome this limitation by extending the cross-species hybridization technique to a number of mammalian species such as Human, Mouse, Rat, Macaque, Horse, Cat, Sheep, Pig, Chinchilla, Chimpanzee, Gray tailed opussum and Cattle in order to expand the number of features and have developed a 44 K guinea pig oligonucleotide microarray (GPOM). Further, to validate the array, the transcriptome of guinea pig lungs was analysed post - M. tuberculosis infection. Global gene expression profiling in this study not only facilitated the analysis of immunologically relevant genes but also helped in describing the transcriptional signature of pulmonary granulomas in guinea pigs represented by several key genes and commensurate pathways that are modulated in response to M. tuberculosis infection.

Microarray design and annotation

For gene expression profiling of species that lack genome sequence and/or representative microarray platforms, cross-species hybridization based microarray has conventionally been used. Since, fully annotated guinea pig genome sequence is not available, we employed cross-species hybridization technology to develop a 44 K microarray platform to study gene expression profile in guinea pigs. As described in the Additional file1 and Supporting Table S1, initially a 244 K microarray was designed to contain 60 mer oligonucleotide probes from multiple mammalian species (human, mouse, rat, guinea pig, rhesus monkey, dog, horse, cat, sheep, pig, chimpanzee, chinchilla, gray-tailed opossum and cattle) based on all the probe sequences available from Agilent Catalogue arrays and NCBI mRNA sequences. Especially, the array included 1132 probes based on annotated gene sequences of guinea pig and 92,815 probes corresponding to guinea pig ESTs. The 244 K array was then hybridized with Cy3 labeled cRNA produced from pooled RNA obtained from various guinea pig tissues (lung, liver, spleen, brain, muscle, kidney and bone marrow) and Cy5 labeled genomic DNA isolated from guinea pig spleen tissue. Following hybridization, the array was scanned and features were extracted. The filtration criteria during the probe selection, while developing microarray by cross-species hybridization technology on Agilent platform, are based on comparison of specific signal intensity viz. the background signal intensity. Probes exhibiting significantly higher signal intensity (p < 0.05), at least 2 fold higher as compared to the background are selected for array development. Based on this criterion, a total of 20,023 out of 62,560 probes representing different mammalian genes were selected from the 244 K array. Similarly, a total of 9,823 out of 92,815 probes were selected for ESTs. However, irrespective of the intensities, all the 1,132 probes for guinea pig were included. Further, an additional of 12,825 best probes out of 19,975 newly added guinea pig EST’s from NCBI database were added to the 44 K array (Table1). Thus, the final design of the guinea pig 44 K microarray comprised of a total number of 45,220 features including 29,846 valid features from different mammalian species (Figure1), 1,132 probes for guinea pig transcripts and 12,825 probes for guinea pig ESTs, 1,264 Agilent positive controls and 153 Agilent negative controls. Agilent positive and negative controls are standard set of probes employed by the Agilent microarray platform for mammalian microarray studies. The negative control probes are intended to have no hybridization and these are used by feature extraction software for background determination. The positive controls are used to have predictable signals, which are used for monitoring the microarray linearity, sensitivity and accuracy. Based on the above-mentioned method for probe selection, many genes are represented by multiple but unique probes derived from different mammalian species. Use of multiple unique probes per transcript in general increases the confidence of microarray result. The averaging of the signals from multiple probes provides improved statistical confidence, reducing the impact of inconsistent probe behavior and improving the signal to noise ratio compared to the platforms that offer fewer probes per gene. For biological interpretation, homolog Gene ontology annotation was also obtained for all the probes by blast-based homology to reference sequence database of human, mouse and rat for which validated methods for biological pathway analysis are available.

Table 1

Probe distribution in 44 K GPOM: The table depicts the number of features derived from various mammalian species that have been used for designing the 44 K GPOM

Organism

Sequence data source

Number of Probes in 44 K Array

Human (Homo sapiens)

Agilent Catalogue Arrays

8964

Mouse (Mus musculus)

Agilent Catalogue Arrays

4889

Rat (Rattus norvegicus)

Agilent Catalogue Arrays

2863

Rhesus Monkey (Macaca mulatta)

Agilent Catalogue Arrays

3667

Dog (Canis familiaris)

Agilent Catalogue Arrays

6000

Horse (Equus caballus)

NCBI, mRNA sequences

186

Cat (Felis catus)

NCBI, mRNA sequences

64

Sheep (Ovis aries)

NCBI, mRNA sequences

164

Pig (Sus scrofa)

NCBI, mRNA sequences

1744

Guinea pig (Cavea porcellus)

NCBI, mRNA sequences

13957**

Chinchilla lanigera

NCBI, mRNA sequences

16

Chimpanzee (Pan troglodyte)

NCBI, mRNA sequences

201

Gray tailed opussum (Monodelphis domestica)

NCBI, mRNA sequences

25

Cattle (Bos taurus)

NCBI, mRNA sequences

1063

Total Number of probes

43803

** 1132 + 12825 EST.

Figure 1

Distribution of probes in 44 K GPOM. The figure depicts the % distribution of oligonucleotide probes present in the 44 K guinea pig oligonucleotide microarray. The 60mer oligonucleotide probes were designed based on several mammalian species including human (Homo sapiens), mouse (Mus musculus), rat (Rattus norvegicus), rhesus monkey (Macaca mulatta), dog (Canis familiaris), horse (Equus caballus), cat (Felis catus), sheep (Ovis aries), pig (Sus scrofa), chimpanzee (Pan troglodyte), chinchilla (Chinchilla lanigera), gray-tailed opossum (Monodelphis domestica), cattle (Bos taurus) and guinea pig (Cavia porcellus). The % representation of a particular species is calculated with respect to the total number of probes in the array. The figure does not show some of the mammalian species separately for which, the % representation is < 1% and are collectively labelled as – others. Number wise distribution of probes from all the species is given in Table1.

We next compared the pulmonary gene expression profile of infected and uninfected guinea pigs by employing the GPOM developed in this study. As depicted in Additional file2, clustered heat maps were obtained for all the genes on the 44 K GPOM in case of infected guinea pigs compared to uninfected control. Since, a small perturbation in the gene expression may considerably influence the biological response, a small difference in the fold change in gene expression are also relevant. Hence, genes exhibiting ≥1.5 fold difference in gene expression with a statistical significance of p < 0.05 were considered as differentially regulated. The rationale behind the selection of the cutoff for considerable fold difference while comparing the gene expression is based on standard practice in microarray data analysis[5–10].

Based on this, several unique genes were identified that exhibited a significant regulation in response to infection. While, 1344 unique genes exhibited a marked up regulation, 1856 genes were significantly down regulated in the lungs of infected guinea pigs as compared to the lungs of uninfected animals as depicted by a heat map in Figure3. The genes exhibiting significant regulation are listed in Additional file3.

Figure 3

Pulmonary gene expression signature of guinea pigs at 10 weeks postM. tuberculosisinfection. Transcriptional profile of lungs of guinea pig was analysed by microarray. The figure depicts the clustered heat maps obtained thereof for the genes expressed in a differential manner between the experimental and control groups. By using unsupervised hierarchical clustering algorithm, the most similar expression profiles are joined together to form a group. These are further joined in a tree structure, until all data forms a single group. Clustering is based on Average- distance between two clusters, which is the average of the pair-wise distance between entities in the two clusters. For the measurement of similarity between conditions, Pearson coefficient correlation clustering algorithm is used. The color scheme for the hierarchical clustering is yellow: no change in expression, magenta: higher expression in infected lungs relative to normal lungs and green: lower expression in infected samples relative to normal uninfected lungs. 1: Uninfected lung; 2: Infected Lung 1; 3: Infected Lung 2; 4: Infected Lung 3.

Differentially regulated genes were further classified in to different categories based on their direct or indirect involvement in various biological processes or pathways. Based on this categorization, a significant alteration was observed in the expression of several important genes related to metabolic pathways, cell signaling, immune response and other miscellaneous functions (Additional file4). Some of the key pathways are listed in Tables2,3 and4.

Validation of microarray results by real time RT-PCR

For the validation of 44 K GPOM, 5 genes were selected based on the following criteria: (i) differential regulation, (ii) immunological relevance and (iii) availability of cDNA sequence in the NCBI database. Expression of these genes was analyzed on the same RNA samples, which were used for the microarray study by semi- quantitative real time RT-PCR by employing SYBR green PCR Master Mix (Applied Biosystems) with 18S as the internal control. The primer sequences of C3AR1, CAMP, CCL5, IFNγ, C4BPA and 18S rRNA genes were designed using the guinea pig gene specific cDNA sequences available in the public database (NCBI) based on the recommended guidelines for designing real time PCR primers (Primer express software, Applied Biosystems]. Sequences of the primers are described in Supporting Table S2 in Additional file1. The primers were experimentally validated for two standard quality control criteria, (i) single amplicon specificity by analysis of dissociation curves and (ii) consistently high amplification efficiency by analysis of calibration curve. For analysis of real time PCR data ΔΔCt method was employed[1]. Consistent with our microarray results, wherein several probes corresponding to C3AR1, CAMP, CCL5 and IFNγ exhibited a down regulation along with up regulation of C4BPA, real time PCR results also matched the microarray expression profile (Supporting Table S2 in Additional file1).

Guinea pig model has made tremendous contribution towards the understanding of several infectious and non-infectious human diseases[1, 2]. Moreover, in case of TB, guinea pig has emerged as the most preferred and biologically relevant model to investigate TB pathogenesis and therapy[11]. The success of M. tuberculosis as an intracellular pathogen is primarily attributable to its ability to reside in human lungs inside hypoxic granulomas in a dormant stage for years or even decades. However, the conventional mouse model for TB does not form hypoxic granulomas, which serve as the primary host defense mechanism for the containment of infection and is the central feature of TB pathogenesis in humans. Thus, due to its ability to produce hypoxic granulomas guinea pig as a surrogate model fulfills an important niche in the field of TB. However, the lack of a fully annotated genome sequence remains a major impediment towards the realization of full potential of guinea pig as an animal model. Moreover, till date not even a single study has investigated the genome-wide transcriptional response associated with M. tuberculosis infection in guinea pigs. Thus, development of a microarray platform to understand the host responses in guinea pigs is a critical step towards harnessing the true potential of this model.

In the present study, by employing cross-species hybridization technique, we have developed an oligonucleotide microarray with 44,000 features assembled from different mammalian species, which to the best of our knowledge is the first attempt to employ microarray to study global gene expression profile in guinea pigs. In order to demonstrate the utility of this microarray and to gain insight into the host responses, we have carried out the expression profile of guinea pig lungs at 10 weeks post-infection with M. tuberculosis. At this stage of infection, guinea pigs exhibit advanced stage of TB disease with multiple coalescing granulomas along with caseation and liquefaction necrosis in the lungs[2]. Hence, the gene expression profile observed in this study represents a transcriptional signature of advanced progressive TB disease in guinea pigs.

In our study, the pulmonary transcriptional profiling of M. tuberculosis infected guinea pigs revealed a significant regulation of 3200 unique targets. While, 1344 unique genes exhibited a marked up regulation, 1856 genes were significantly down regulated. Differentially regulated genes were further classified into different categories based on their direct or indirect involvement in various biological processes or pathways. A massive re-alignment of metabolic pathways, mostly associated with catabolism, emerged as one of the interesting themes from this analysis. Although, altered metabolic functions of the host have earlier been reported in human subjects and in laboratory animals in response to febrile infections involving wasting of body tissues[12], most studies have been restricted to biochemical and in silico analysis and have not looked beyond immune mechanisms in order to probe the underlying cause of pathology and active disease in case of TB[13, 14]. Only recently, Mi-Jeong and colleagues reported a correlation between caseation of human TB granulomas with elevated host lipid metabolism in these infected tissues by microarray[15]. Extensive necrosis observed in the pulmonary granulomas in our study as well as a marked up regulation of several of these lipid homeostasis related genes, such as, ABHD2, ABHD8, ACSL1, ACSL5, CYP27A1, CYP2B18A, CYP26B1, CYP2F1, CYP2A13, CYP1A2, CYP11A1, CYP2D40, CYP2F1, FDPS, HADHA and LPL correspond well with the observations associated with human caseous granulomas[15]. On comparing the entire list of up and down regulated genes from our guinea pig study with that obtained from human TB granuloma study [GEO Accession no. GSE20050][15], we observed that 38% of the up regulated genes of guinea pig [512 out of 1344 genes] exhibited an overlap with the genes up regulated in humans (Figure4). Further, on comparing the microarray data available in the public database for TB infection in case of humans [GEO Accession no. GSE20050][15], mouse [GEO Accession no. GSE15335][16] and non-human primates [GEO Accession no. GPL10183][17], while, the non-human primates and humans exhibited a 19% overlap between up regulated genes, the overlap between mouse and humans was 18% (Figure4 and Additional file5). The guinea pig model is known for its close similarity to humans in terms of pathological response to M. tuberculosis infection[2]. Our observations indicate that guinea pigs also exhibit higher resemblance to humans in terms of transcriptional response to M. tuberculosis infection, which further validates it as an excellent animal model to study TB. Hence, findings of this study would have a direct implication towards the development of novel therapeutic interventions. Besides, it would also permit the development and validation of biomarkers for effective vaccines and drugs in guinea pig model.

Figure 4

Comparison of transcriptional response of guinea pig, human, non-human primate and mouse toM. tuberculosisinfection. The Venn diagrams depict the degree of overlap between up regulated genes of (A) Guinea pig, human and non-human primates and (B) Guinea pig, human and mouse. The analysis included comparison of the list of differentials obtained from our study with that obtained from various microarray data available in the public database for TB infection in case of; human [GEO Accession no. GSE20050], mouse [GEO Accession no. GSE15335] and non-human primate [GEO Accession no. GPL10183]. Down regulated genes did not show any considerable overlap across the species, hence not depicted in the figure.

Induction of catabolic processes with consequential ATP accumulation has recently been shown to provide an interface between metabolism and host defense to infection[18–20]. ATP molecules generated in response to injury to airway epithelial cells have been reported to be a critical determinant of cell migration and repair following the injury and have been shown to be associated with the activation of down-stream signaling cascades and induction of IL-1β through the interaction of ATP with purinergic receptors[21]. A few in vitro studies have also indicated the role of ATP mediated macrophage apoptosis in killing of M. tuberculosis[19]. A concurrent up regulation in the expression of oxidative phosphorylation related genes (expected to result in increased ATP levels), purinergic receptors and IL-1β in this study to the best of our knowledge, provides the first in vivo evidence for the involvement of these pathways in TB. Further, the lungs of the infected guinea pigs also exhibited a marked perturbation in the expression of several key genes associated with chemokine signaling (CCL27, CCL5, CXCL9, CXCR3, CCL21 and CCL11), cell adhesion molecules (CAMs) (HLA, ALCAM, MPZL1, CADM3, CADM1, CD34, CD8A, CD99, CDH3, CLDN4, CLDN6, NCAM1, ITGB2, ITGB8 and ITGA9) and cytokine and cytokine receptors (IL1β, IL1RAP, IL2RG, IL8, IL9, IL23A, IL23R, TGFB1, TGFB3, IFNGR2, TNFα, TNFSF10, CSF1R, BMP4, BMP8A, BMPR1A, BMPR2, LTA and ACVR2A), which are known to contribute to leukocyte trans-endothelial migration, inflammation and granulomatous pathology.

Perturbation in the cellular signaling pathways is another typical theme that emerged from our study. The most prominent observation relates to the repression of numerous genes related to MAPK, Wnt and calcium signaling pathways. These observations are consistent with previous studies, which have suggested that modulation of MAPK signaling pathway along with the reduction in the levels of intracellular calcium are some of the important means by which, M. tuberculosis represses phagosome - lysosome maturation and pro-inflammatory responses at the site of infection[22, 23]. MAPK signaling is known to be crucial for the anti-bacterial response of the host and it also represents a strategic target for bacterial subversion tactics[24]. Thus, dampening of the MAPK signaling has emerged as a key to achieve alteration in the antibacterial phenotype of macrophages. Recently, Wnt signaling pathway has been implicated in the generation of long-lived multi-potent memory T cells and in the modulation of inflammatory response of macrophages to M. tuberculosis infection[25, 26], thus repression of Wnt signaling pathway observed in this study suggests a possible mechanism by which, M. tuberculosis inhibits effective T cell memory response.

Another key observation from this study relates to the modulation of several key genes involved in the re-modeling of extracellular matrix such as, COL6A2, COL14A1, COL12A1, MMP24, TIMP1, SERPING1, SERPINB1, ADAMTS1, ADAMTS7, MMP1, PITRM1, SERPINA3N, SERPINB6, SERPINE2, SERPINH1 and CNDP2. These observations are in agreement with the previous studies, which have implicated these genes in dual events associated with tissue remodeling as well as tissue damaging[27–30]. The presence of extensive necrosis along with thick bands of collagen observed in the guinea pig pulmonary granulomas indicates that the balance is heavily tipped in the favor of tissue damaging events. Increased expression of the complement receptor CR2 and numerous key genes involved in phagocytosis and antigen presentation as observed in this study further substantiates that the pathogen exploits the normally effective defense system to its advantage by subverting or co-opting these pathways as has been also reported earlier[17, 22].

This study reports for the first time the development of a 44 K oligonucleotide microarray for guinea pigs and provides an important tool to capture the genome wide transcriptional changes in this model. The transcriptional profiling of M. tuberculosis infected guinea pig lungs not only revealed modulation of key immunologically relevant genes but also demonstrated involvement of novel metabolic and signaling pathways in TB pathogenesis. Moreover, in silico analysis revealed a higher resemblance of guinea pigs to humans in terms of transcriptional response to M. tuberculosis infection when compared to mouse and non-human primates. Development of the 44 K GPOM is thus, a critical step towards characterization of the guinea pig model, which will greatly aid in improving our understanding of host responses to a number of infectious diseases. We believe that optimal use of guinea pig model and further research on its biology would generate tremendous opportunity to understand host-pathogen interaction and thus, help in the development of new therapeutic intervention strategies.

Ethics statement

Guinea pig experiments were reviewed and approved by the Institutional Animal Ethics Committee of University of Delhi South Campus, New Delhi, India (Permit number: 159/1999/CPCSEA). All procedures with the infected animals/tissues were performed in a Biosafety level three (BSL-3) containment facility at the University of Delhi South Campus, according to the approved protocols. All animals were routinely cared for according to the guidelines of CPCSEA (Committee for the Purpose of Control and Supervision on Experiments on Animals), India with a 12 hr light/dark cycle (0600–1800) maintained at 25 degrees Celsius with a relative humidity of 50%.

Experimental animals, infection and study design

Pathogen free 6–8 weeks old (200-300 g) female outbred guinea pigs (Dunkin Hartley strain) used for the microarray studies were procured from Disease Free Small Animal House Facility, CCS Haryana Agricultural University, Hisar, India. The infected and uninfected animals were housed separately in individually ventilated cages (2 animals/cage) and were provided with ad libitum food and water in the BSLIII facility at University of Delhi South Campus, India. Guinea pigs were infected by using the method as described previously[31]. Briefly, M. tuberculosis (H37Rv strain, ATCC no. 25618 procured from AIIMS, New Delhi, India) was grown to mid-log phase in Middle Brook 7 H9 media (0.05% Tween80 and 0.5% Glycerol) and stocks were prepared as described[32]. The CFU of stocks was enumerated by plating 10 fold serial dilutions on 7 H11 agar (1XADC and 0.5% glycerol). By using pre-calibrated infection parameters, guinea pigs were infected in the inhalation exposure system, (Glas-col Inc.), which resulted in ~500 bacilli in the lungs of guinea pigs at day 1 post-infection. For enumeration of day 1 CFU, the whole lung homogenates were plated onto 7 H11 agar plates and colonies were counted after 3 weeks of incubation at 37°C.

Necropsy procedure and histopathological evaluation

Guinea pigs were euthanized by carbon dioxide asphyxiation. After aseptically dissecting the animals, for histopathological evaluation, three lung lobes (right caudal, middle and cranial) were removed and fixed in 10% neutral buffered formalin. Left caudal lung lobe was aseptically removed for the measurement of bacillary load. A portion of left cranial lung lobe was stored in RNA later® (Ambion) at −20°C for isolation of RNA to be used for microarray and real time RT-PCR studies.

For histopathological examination, as described previously[31] sections of 5 μm thickness from formalin fixed and paraffin embedded tissues were cut on to glass slides and stained with haematoxylin and eosin. The percent granuloma in lung, type and extent of necrosis, organization of granuloma along with the type of infiltrating cells were assessed. In order to determine the extent of collagen deposition and fibrosis, the lung sections were also stained with Van Gieson stain.

Bacterial enumeration

Specific portions of lungs were weighed and homogenized separately in 5 ml saline in a Polytron homogenizer. Appropriate dilutions of the homogenates were inoculated on to MB7H11 agar plates in duplicates and incubated at 37°C in a CO2 incubator for three to four weeks. The number of colonies were counted and expressed as log10 CFU/g of tissue. The detection limit in case of both lungs CFU was 1.0 log10 CFU/g.

Labeling of RNA samples and quality control for 44 K microarray hybridization

To evaluate the gene expression profile associated with pulmonary tuberculosis, total RNA was isolated from lung tissues of M. tuberculosis infected and naive control guinea pigs (n = 3) by using Qiagen’s RNeasy mini kit as per the manufacturer’s recommendations followed by assessment of quality (specific activity) and quantity (yield) by using Bioanalyzer (Agilent Technologies). RNA samples then were labelled with Cy3 by using Agilent Quick-Amp labeling Kit as per the manufacturer’s recommendations. Briefly, 500 ng each of the control and test RNA samples were incubated with reverse transcription mix at 40°C and converted to double stranded cDNA primed by oligo dT with a T7 polymerase promoter. Synthesized double stranded cDNA was then used as template for cRNA generation. cRNA was generated by in vitro transcription along with the incorporation of Cy3 CTP during this step (Agilent Technologies). By using Qiagen’s RNeasy mini kit, fluorescently labeled cRNA was purified followed by the assessment of quality and quantity by using Bioanalyzer (Agilent Technologies).

Hybridization and scanning

Linear amplified Cy3 labeled cRNA were hybridized to guinea pig 44 K microarray. Briefly, 1.65 μg of Cy3 labeled cRNAs were fragmented and hybridized to the array. Fragmentation of labeled cRNA and hybridization were carried out by using the Agilent Gene Expression Hybridization kit. Hybridization was carried out in Agilent’s Surehyb Chambers at 65°C for 16 hours. Following hybridization, the slides were washed by using Agilent Gene Expression wash buffers and scanned at 3μm resolution by using the Agilent Microarray Scanner G2505C under conditions to limit saturation to < 80% and were saved as TIFF images. The features then were extracted with the Feature Extraction Software (Agilent technologies, v10.7). Details of design, development and annotation of microarray are provided in Supporting materials and methods in Additional file1.

Microarray data analysis

The data was analyzed by using GeneSpring GX v11.5 software (Agilent Technologies). Normalization of the data was carried out in GeneSpring GX by using the 75th percentile shift. It subtracts this value from the expression value of each entity and normalizes to specific control samples and fold change was calculated for the infected group relative to the baseline control (cRNA derived from uninfected animals). The fold intensity data then were filtered for significantly regulated (up and down regulated) genes in the treatment group in comparison to the control group based on the following stringent criteria: Expression fold values are provided in terms of log base 2. For up regulated genes, the cutoff used is fold change > 0.6 along with geometric mean fold change > 1 and Flags "detected" in infected samples. For filtering the down regulated genes, the cutoff used is fold change < − 0.6 along with geometric mean fold change < −1 and Flags "detected" in normal uninfected control sample that means a change in expression by ≥ 1.5 fold (up or down). Based on these criteria, only those genes, which showed a significant regulation in triplicates, were further subjected to hierarchical clustering based on Pearson coefficient correlation algorithm to identify significant gene expression patterns. Genes were classified on the basis of functional categories and pathways by using GeneSpringGX software and Genotypic Biointerpreter Biological Analysis software (Genotypic Technology Pvt. Ltd.). The microarray data reported here have been submitted at NCBI’s Gene Expression Omnibus [GEO Accession number: GSE32447].

Real time RT-PCR

For the validation of 44 K GPOM, expression of a few selected genes was analysed by real time RT-PCR by employing SYBR green PCR Master Mix (Applied Biosystems) on the same RNA samples, which were used for the microarray study. Sequences of the primers employed for real time RT-PCR of C3AR1, CAMP, CCL5, IFNγ, C4BPA and 18S rRNA genes and comparison of gene expression pattern with respect to microarray are described in Supporting Table S2 in Additional file1. For the analysis of real time PCR data ΔΔCt method was employed. First, ΔCt value was calculated for each sample as the difference between the Ct values for the gene of interest and the housekeeping gene (18S) in each sample. Then, ΔΔCt value was calculated as the difference between the ΔCt values of an experimental sample and the control sample. Fold change in gene expression were calculated as 2 -ΔΔCt.

Statistical analysis

Mean differences for percent fold induction in mRNA expression levels were analyzed by student’s t test. Differences were considered statistically significant, when p < 0.05. Based on Pearson coefficient correlation algorithm, the samples were clustered by using hierarchical clustering to identify similar conditions.

Acknowledgements

We acknowledge Genotypic Technology Pvt. Ltd Bangalore for the microarray processing and data analysis reported in this publication. Priyanka Chauhan is acknowledged for critical reading of the manuscript. This work was supported by a research grant from the Department of Biotechnology, Ministry of Science and Technology, Government of India, New Delhi. However, the funding agency had no role in the study design, data collection and analysis, decision to publish or preparation of the manuscript.

Electronic supplementary material

12864_2012_4269_MOESM2_ESM.tiffAdditional file 2: Pulmonary gene expression signature of guinea pigs at 10 weeks postM. tuberculosisinfection. The figure depicts the clustered heat maps for all the genes on the 44 K GPOM in case of infected guinea pigs compared to uninfected control. By using unsupervised hierarchical clustering algorithm, the most similar expression profiles are joined together to form a group. These are further joined in a tree structure, until all data forms a single group. Clustering is based on averaged distance between two clusters, which is the average of the pair-wise distance between entities in the two clusters. For measurement of similarity between conditions, Pearson coefficient correlation clustering algorithm is used. The color scheme for the hierarchical clustering is - yellow: no change in expression, magenta: higher expression in infected lungs relative to normal lungs and green: lower expression in infected samples relative to normal uninfected lungs. 1: Uninfected control; 2: Infected Lung 1; 3: Infected Lung 2; 4: Infected Lung 3. (TIFF 80 KB)

12864_2012_4269_MOESM4_ESM.xlsAdditional file 4: Pathway analysis for the up and down regulated genes. The excel file describes the pathway analysis for the up and down regulated genes. The pathway analysis was performed by using Human Biointerpreter tool (http://genotypic.co.in/biointerpreter.html) based on KEGG database. (XLS 98 KB)

12864_2012_4269_MOESM5_ESM.xlsAdditional file 5: Comparison of pulmonary transcriptional response of guinea pig, human, non-human primate and mouse toM. tuberculosisinfection. The excel file depicts the comparison of up and down regulated genes from this guinea pig microarray study with those obtained from human, non-human primate and mouse microarray data available in the public database. The Venn diagrams depict the number of genes overlapping among different species. (XLS 226 KB)

Below are the links to the authors’ original submitted files for images.

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.