Background

To identify transcription factors (TFs) and single nucleotide polymorphisms (SNPs) of Lrh1 (also named Nr5a2) and its homologous genes in Lrh1-knockout pancreas of mice.

Methods

The RNA-Seq data GSE34030 were downloaded from Gene Expression Omnibus (GEO) database, including 2 Lrh1 pancreas knockout samples and 2 wild type samples. All reads were processed through TopHat and Cufflinks package to calculate gene-expression level. Then, the differentially expressed genes (DEGs) were identified via non-parametric algorithm (NOISeq) methods in R package, of which the homology genes of Lrh1 were identified via BLASTN analysis. Furthermore, the TFs of Lrh1 and its homologous genes were selected based on TRANSFAC database. Additionally, the SNPs were analyzed via SAM tool to record the locations of mutant sites.

Results

Total 15683 DEGs were identified, of which 23 was Lrh1 homology genes (3 up-regulated and 20 down-regulated). Fetoprotein TF (FTF) was the only TF of Lrh1 identified and the promoter-binding factor of FTF was CYP7A. The SNP annotations of Lrh1 homologous genes showed that 92% of the mutation sites were occurred in intron and upstream. Three SNPs of Lrh1 were located in intron, while 1819 SNPs of Phkb were located in intron and 1343 SNPs were located in the upstream region.

Conclusion

FTF combined with CYP7A might play an important role in Lrh1 regulated pancreas-specific transcriptional network. Furthermore, the SNPs analysis of Lrh1 and its homology genes provided the candidate mutant sites that might affect the Lrh1-related production and secretion of pancreatic fluid.

The pancreas is an endocrine gland, producing insulin, glucagon, somatostatin, and pancreatic polypeptide, and also an exocrine gland, accounting for more than 98% of pancreatic gland and secreting pancreatic juice containing digestive enzymes [1]. These digestive enzymes help to further break down the carbohydrates, proteins and lipids in the chime and thus support the absorption and digestion of nutrition in small intestine [2]. In the past decades, many research have focused on target genes and transcription factors (TFs) involved in the exocrine pancreas-specific transcriptional networks which are required for the production and secretion of pancreatic fluid that helps out the digestive system. Currently, many exocrine pancreas-specific genes and transcription factors have been identified, which may promote the understanding of the effect of exocrine pancreas on digestive system.

Liver receptor homolog-1 (Lrh1; also called Nr5a2) is a nuclear receptor of ligand-activated transcription factors in liver by binding as a monomer to DNA sequence elements with the consensus sequence 5′-Py-CAAGGPyCPu-3′ [3]. It has been suggested that Lrh1 is progressively expressed in both the endocrine and exocrine pancreas [4]. Baquié M et al. [5] have found that Lrh1 is expressed in human islets and protects β-cells against stress-induced apoptosis that may be mediated via the increased glucocorticoid production that blunts the pro-inflammatory response of islets. Meanwhile, Fayard E et al. [6] have demonstrated that both Lrh1 and CEL (encoding carboxyl ester lipase) are co-expressed and confined to the exocrine pancreas. The identification of CEL as an Lrh1-target gene indicates that Lrh1 plays an important role in enterohepatic cholesterol homeostasis associated with the absorption of cholesteryl esters and the assembly of lipoproteins by the intestine [7]. Besides, Lrh1 is a downstream target in the PDX-1 (lead to pancreas agenesis) regulatory cascade that is activated only during early stages of pancreas development and that governs pancreatic development, differentiation and function [8].

Recently, the rapid advent of next-generation sequencing has made this technology broadly available for researchers in various molecular and cellular biological fields. Holmstrom SR et al. [9] have determined the cistrome and transcriptome for the nuclear receptor LRH-1 in exocrine pancreas and revealed that Lrh1 directly induces expression of genes encoding digestive enzymes and secretory and mitochondrial proteins based on Chromatin immunoprecipitation (ChIP)-seq and RNA-seq analyses. Besides, Lrh1 cooperates with the pancreas transcription factor 1-L complex (PTF1-L) in regulation of exocrine pancreas-specific gene expression. However, many potential target genes and TFs of Lrh1 based on RNA-seq analysis have not been revealed.

In the present study, we downloaded the raw RNA-seq data of Holmstrom SR et al. deposited in The National Center for Biotechnology Information (NCBI) database, which were analyzed using multiple bioinformatics tools in the purpose of finding specific TFs of Lrh1 and its homology genes. Additionally, we also annotated the SNPs of Lrh1 and its homology genes to predict their mutant sites. Our study might improve the understanding of the regulation network of Lrh1-related production and secretion of pancreatic fluid.

Data pre-processing, gene expression and homology gene of Lrh1

The raw data were downloaded from SRA (Sequence Read Archive) of NCBI and then converted to fastq reads using fastq-dump program of NCBI SRA Toolkit (−q 64) (http://trace.ncbi.nlm.nih.gov/Traces/sra/sra.cgi?view=std). Then, these reads were processed through TopHat [10] and Cufflinks [11] package to calculate gene-expression level. All parameters were set up according to the default settings of TopHat and Cufflinks. The DEGs were identified via non-parametric algorithm (NOISeq) methods in R package [12]. The thresholds value was False Discovery Rate (FDR) < 0.001. BLASTN analysis [13, 14] of the selected DEGs was used to identify the homology genes of Lrh1. Homology genes here refer to the paralogous genes which share a high degree of sequence similarity (maximum expectation value was set to e−5) with Lrh1 in mice.

Function annotation of Lrh1homologous genes

For functional analysis of Lrh1 homologous genes, DAVID (Database for Annotation, Visualization and Integrated Discovery) [15] was performed for Gene Ontology (GO) [16] function and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis.

Screening of SNPs

The fastq reads were mapped to marker sequences using bowtie [18]. And the aligned reads were called using the SAM tool [19]. In order to minimize the risk of false-positive SNP Callings, the threshold value was that ID was “*” with quality > 50, or ID was not “*” with quality > 20. These SNPs were annotated via SnpEff [20] to categorize the effects of variants in genome sequences. The identified SNPs were searched in the dbSNP database to identify diseased SNPs or de novo discovered SNPs.

Potential TFs of Lrh1homologous genes

Fetoprotein transcription factor (FTF) (ID: T04754) of Lrh1 was the only TF identified based on TRANSFAC database. Meanwhile, the promoter-binding factor of Lrh1 was CYP7A (Cholesterol 7α-hydroxylase).

SNPs of Lrh1homologous genes

The annotation of SNPs of Lrh1 homologous genes showed that the majority of SNPs were located in intron and upstream, accounting for nearly 92% of all SNPs (Tables 2 and 3). Three SNPs of Lrh1 were distributed in intron. Meanwhile, total 1819 SNPs of Phkb were located in the intron and 1343 SNPs were located in the upstream region of Phkb.

In the present study, combined with RNA-seq data of Lrh1-knockout pancreas samples, FTF was the only TF of Lrh1 identified based on TRANSFAC database and may regulate cholesterol catabolism into bile acids by activation of the promoter-binding factor CYP7A. Many literatures have elucidated the function of Lrh1/Nr5a2/FTF/CYP7A via experimental studies [21–25].

FTF is highly expressed in the liver and intestine and is implicated in the regulation of cholesterol, bile acid and steroid hormone homeostasis [26]. Nearly 50% of the body cholesterol is catabolized to bile acids via bile acid biosynthetic pathway, of which cholic acid (hydroxylated at position 12) and chenodeoxycholic acid are the major primary bile acids and play an important role in cholesterol homeostasis [19]. Chenodeoxycholic acid can repress FTF expression and is a more potent suppressor of HMG-CoA reductase and cholesterol 7α-hydroxylase/CYP7A1 (7α-hydroxylase) than cholic acid [27]. It has been proposed that Lrh1, also known as CYP7A promoter-binding factor, LRH1, or FTF, is required for the transcription of the 7α-hydroxylase gene [19, 28]. The small heterodimer partner 1 (SHP) of the nuclear bile acid receptor, FXR (farnesoid X receptor) can dimerize with FTF and diminish its activity on the 7α-hydroxylase promoter [29].

Although Lrh1 has been demonstrated the function in feedback regulation of CYP7A1 expression as part of the FXR-SHP-LRH-1 cascade, in which bile acids can inhibit their own synthesis, the mechanisms have not been well understood. Out C et al. [25] have suggested that CYP7A1 expression is increased rather than decreased under chow-fed conditions in Lrh1-knockdown mice that is coincided with a significant reduction in expression of intestinal Fgf15, a suppressor of CYP7A1. Besides, Noshiro M et al. [30] have suggested that the circadian rhythm of CYP7A is regulated by multiple transcription factors, including DBP, REV-ERBα/β, LXRα, HNF4α DEC2, E4BP4, and PPARα. Hepatocyte nuclear factor 4α (HNF4α) and FTF are two major TFs driving CYP7A1 promoter activity in lipid homeostasis. Bochkis IM et al. [31] have shown that prospero-related homeobox (Prox1) directly interacts with both HNF4α and FTF and potently co-represses CYP7A1 transcription.

In the present study, we annotated the SNPs of Lrh1 and its homologous genes, showing that the majority was located in intron and upstream. Quiles Romagosa MÁ [32] has reported that a functional SNP located in Lrh1 promoter is related to Body Mass Index (BMI) and these SNPs might play important roles in the obese phenotype. However, previous researches mostly focused on SNPs associated with pancreatic cancer cell growth and proliferation. For example, a previous genome-wide association study has identified five SNPs on 1q32.1 associated with pancreatic cancer that mapped to Lrh1 gene and its up-stream regulatory region [33].

In conclusion, FTF combined with CYP7A might play an important role in Lrh1 regulated pancreas-specific transcriptional network. Furthermore, the SNPs analysis of Lrh1 and its homology genes provided the candidate mutant sites that might affect the Lrh1-related production and secretion of pancreatic fluid. These common susceptibility loci for Lrh1 and its homologous genes needed follow-up studies.

Highlights

1.

Total 15683 DEGs were identified, of which 23 was Lrh1 homology genes (3 up-regulated and 20 down-regulated).

2.

Fetoprotein TF was the only TF of Lrh1 identified based on TRANSFAC database and the promoter-binding factor of fetoprotein TF was CYP7A.

3.

The SNP annotations of Lrh1 homologous genes showed that 92% of mutation sites were occurred in intron and upstream. Three SNPs of Lrh1 were located in intron, while 1819 SNPs of Phkb were located in intron and 1343 SNPs were located in upstream region.

Competing interest

The authors declare that they have no competing interests.

Authors’ contributions

MT, XM and CL participated in the design of this study, and they both performed the statistical analysis. RJ, GYH and YZ carried out the study, together with LQ, collected important background information, and drafted the manuscript. HL, XPW and ZS conceived of this study, and participated in the design and helped to draft the manuscript. All authors read and approved the final manuscript.

Leung PS: Physiology of the pancreas. The Renin-Angiotensin System: Current Research Progress in The Pancreas: The RAS in the Pancreas, Volume 690. 2010, Netherlands: Springer, 13-27.View ArticleGoogle Scholar

Pre-publication history

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.