Intron retention in Cathelicidin-4 in river buffalo
Bulletin of the National Research Centre volume 43, Article number: 116 (2019)
The function of cathelicidins-4 (CATH4) is not limited to microbial killing, but extends to other aspects of immunity and tissue repair. The presence of different CATH4 variants including intron retention affects the immunity system. Intron retention, in buffalo, is not fully studied. In this study, we investigated CATH4 mRNA in river buffalo and their variants, which can be used in the future for selecting buffalo resistant to diseases.
Results and conclusion
Analysis of CATH4 mRNA in river buffalo (Egyptian breed) revealed the presence of a novel variant (1073 bp) which includes unspliced part of intron 3 (469 bp) in addition to previously reported unspliced complete intron 1 (103) and intron 2 (137 bp). Identification of intron retention was conducted by comparing the amplified unspliced cDNA and DNA sequences. Analysis of the 3 retained intronic regions revealed the presence of the 4 splice signals, needed for splicing which include the 5′ (GT) and 3′ (AG) intron splice sites, the branch point, and the polypyrimidine tract. However, in the intron-retained sequence, the polypyrimidine tract was weak. It contained 6 and 4 non-continuous uridine stretch in introns 1 and 2, respectively, (intron 3 was partial) which may have caused introns retention. In addition, analysis of the unspliced sequence showed three unique exonic SNPS located close to the splice sites (1 to 22 nucleotides) and five SNPs in retained intronic regions located near the splice sites (18 to 246 nucleotides away from exon/intron boundaries) which may be related to the retention of the three introns.
Cathelicidin-4 (CATH4), popularly known as indolicidin, has broad and rapid microbicidal effect that may be critically important to clear tissues from pathogens and to prevent the onset of infection (Dorschner et al. 2001). The function of cathelicidins is not limited to microbial killing, but extends to other aspects of immunity and tissue repair (Gallo et al. 2002). CATH4 contains 4 exons and 3 introns: the first 3 exons comprise signal peptide and cathelin prodomain (N-terminal) while the fourth exon encodes the cleavage site and variable C-terminal antimicrobial peptide (Zanetti et al. 2000; Zaiou and Gallo 2002).
Water buffalo (Bubalis bubalis) population includes river (Bubalus bubalis bubalis) and swamp buffalo (Bubalus bubalis carabanesis), 77% of which are river buffalo (FAO, 2013). Buffalo are major source of meat, milk, and its bi-products. Buffalo surpass the cattle in its ability to adapt to the hot, humid areas of muddy and swampy lands (Marai and Habeeb 2010). Buffalo CATH4 was cloned by Das et al. (2006). The complete CATH4 coding region was found to be 92.9% similar to Bos taurus nucleotide sequence. Several cathelicidin genes were identified in cattle and buffaloes. In cattle, SNPs, insertions, and deletions have been reported in different breeds of Bos taurus and Bos indicus (Gillenwaters et al. 2009). In a study by Brahma et al. (2015), amplicons of cathelicidin genes of 5 breeds of cattle and buffalo were investigated. Buffalo CATH4 genes showed higher single-nucleotide variations compared to cattle genes.
Constitutive splicing of intronic sequences from RNA is the dominant form of gene expression. However, alternative splicing leading to intron retention (IR) has been reported in many bovine genes (Chacko and Ranganathan 2009). Examples of IR have been found in bovine growth hormone (Dirksen et al. 1995), CD46 (Wang et al. 2014), and NCF4 (Ju et al. 2015). A higher relative frequency of IR has been associated with genes with overall shorter intron lengths (~ 100–200 nt), higher expression levels, weaker splice sites, and particular densities of Cis-regulatory elements (Sakabe and de Souza 2007). Recent evidences suggest that single-nucleotide polymorphisms (SNPs) are the main factor that contribute to the generation of alternative splice variants, which can cause degenerative axonopathy (Drögemüller et al. 2011) and a congenital mechanobullous skin disorder (Menoud et al. 2012) in cattle.
In this study, we investigated the different splice variants of mRNA CATH4 in river buffalo (Egyptian breed) which can be used for selecting disease-resistant breeds of buffalo.
Materials and methods
1. Collecting buffalo samples
Blood samples collected on ethylene diamine tetra acetic acid (EDTA) from healthy river buffalo (Egyptian breed) were kindly provided by the veterinarian of the buffalo farm “United Farms Group Company.”
2. DNA extraction
Genomic DNA was extracted from whole blood using salting out method according to Miller et al. (1988). The DNA concentrations were measured using Nanodrop 1000 (Thermoscientific) and were adjusted to 50 ng/μL for polymerase chain reaction (PCR).
3. mRNA extraction and cDNA synthesis
Total RNA from blood was extracted using Easy–RED™ iNtRON Biotechnology, Inc. according to the manufacturer’s instructions. RNA was considered to be free of DNA and proteins with a 260/280 optical density ratio of ~ 2.0. cDNA synthesis was performed using Revert Aid First Strand Synthesis Kit according to the manufacturer’s instructions. To ensure that the RNA was not contaminated with genomic DNA, a PCR reaction was performed using RNA in absence of reverse transcriptase, as a negative control.
5. PCR amplification
PCR amplification for CATH4 was performed in 50 μl reaction volume which includes 20 μl of water (nuclease-free), 25 μl of PCR Master Mix (2X), 1 μl of forward primer (10 μM), 1 μl of reverse primers (10 μM), and 4 μl of 50 ng of DNA or cDNA template. The reaction mixture was run in a Q-Cycler, HVD LifeSciences. The thermal cycling program was initial denaturation at 95 °C for 3 min followed by 40 cycles of the following: denaturation at 95 °C for 30s, annealing for 30s (54 °C or 62 °C), extension at 75 °C for 1 min, and then final extension at 75 °C for 10 min. Detection of PCR products were performed by agarose gel electrophoresis according to the method described by Ausubel et al. (1990). The gels were inspected by Gel documentation system (In Genius, Syngenebioimaging). PCR products were purified using MEGAquick-spinTM Total Fragment DNA Purification Kit (iNtRON biotechnology) according to the kit’s instructions.
6. Sequence analysis and SNPs identification
Purified PCR products were sequenced by Macrogen (Korea) using reverse and forward primers. The specificity of the nucleotide sequences were verified by BLAST analysis (Basic Local Alignment Search Tool) (http://blast.ncbi.nlm.nih.gov/Blast.cgi) (Altschul et al. 1990). Sequences were analyzed by multiple alignments using Clustal Omega (http://www.ebi.ac.uk/Tools/msa/clustalo/). Polymorphic sites were determined by visual examination of sequence’s charts.
7. Determining the four potential splice sites
To predict the potential splice sites 5′ and 3′ (GT/AG), the Splice Port tool (http://spliceport.cbcb.umd.edu/) was used. Prediction of the branch points (BPs) and polypyrimidine tract (ppt) were carried out using SVM-BP finder tool (http://regulatorygenomics.upf.edu/Software/SVM_BP/) (Corvelo et al. 2010).
PCR was conducted on cDNA of buffalo. Only in three samples that the first primer pairs resulted in 526 bp amplicon which corresponded to CATH4 full-spliced coding region with complete 4 exons and part of 3′ untranslated region (Fig. 1). The second primer pair was used with the cDNA samples that were not amplified by the first primer. It amplified segments of 1067 bp (Fig. 2). In the alignment between the two amplified amplicons, 526 bp (full-spliced segment) and 1067 bp segments showed three separate matches (Fig. 3) covering exons1, 2, and 3. This left segments of 102 bp between exons 1 and 2, 136 bp between exons 2 and 3, and 469 bp after exon 3.
In order to determine the nature of the sequences between the exons and beyond, a PCR reaction was conducted using the second primer pair on genomic DNA buffalo sample. The resulted DNA amplicon was aligned with the cDNA (1067 bp segment) using Clustal Omega (https://www.ebi.ac.uk/Tools/msa/clustalo/) (Fig. 4).
The alignment showed that the 1067 bp cDNA (accession# MK507762.1) is an intron-retained amplicon with full retention of intron 1 (102 bp) and intron 2 (136 bp) and a large segment of 5′ of intron 3 (469 bp). In order to find out reasons for unsplicing of the introns, the cDNA intron-retained sequence was analyzed. The analysis includes identifying the locations of any occurring SNPs and the sequences of cis-acting 4 splice signs, needed for splicing. The latter includes the 5′ (GT) and 3′ (AG) intron splice sites, the branch points (BPs), and polypyrimidine tract (ppt).
Prediction of the potential 5′ and 3′ splice sites (GT-AG) in CATH4 mRNA intron-retained form was determined using the Splice Port tool (http://spliceport.cbcb.umd.edu/), and feature generation algorithm (FGA) scores were calculated (Table 2). The analysis showed that the donor splice sites in the three introns are the top splice site candidates which indicate that the three introns should have been spliced.
The potential branch points (BPs) and polypyrimidine tract (ppt) in introns 1 and 2 (intron 3 was only 5′ partial) of the intron-retained sequence were investigated using the SVM-BP finder (http://regulatorygenomics.upf.edu/Software/SVM_BP/).
The candid positions were the ones with svm_score of 1.3308 and 0.9529 located at distances of 35 nt and 15 nt from the 3′ splice site of intron 1 and intron 2, respectively, (Table 3).
Figure 5 shows all candidate branch points and polypyrimidine tract in intron 1 and intron 2 nucleotide sequences of the intron-retained cDNA. In intron 1 and intron 2, the uridine tracts had only 6 and 4 non-continuous uridines (t), respectively.
SNPs positions relative to splicing sites
Polymorphic sites and their distances from the GT splice site were determined (Table 4) in nucleotide sequences of the spliced cDNA (8 SNPs) and the intron-retained cDNA (13 SNPs; 8 in exonic regions and 5 in intron-retained segments).
Intron retention, a form of alternative splicing that affects the mechanism of gene expression control in mammals, is not fully yet studied. It enhances gene regulatory complexity in vertebrates (Schmitz et al. 2017) and plays an essential conserved role in normal physiology and in diverse diseases (Wong et al. 2016). It was in 1997 that Coolidge et al. reported the role of the cis-regulatory elements in intron retention. Galante et al. (2004) and Sakabe and de Souza (2007) reported that retained introns are on the average shorter, more C/G rich, and associated with weaker splice sites than constitutive introns.
In the present work, we have detected retention of the three introns in CATH4 of Egyptian buffalo. Accurate intron splicing occurs in the presence of strong cis-acting 4 splice signs which include the 5′ (GT) and 3′ (AG) intron splice sites, the branch points (BPs), and polypyrimidine tract (ppt) sequences which are essential for accurate splicing (Black 2003). However, these cis-sequences provide only one half of the information required for recognition by the splicing machinery (Lim and Burge 2001). The number of nucleotides between the branchpoint and the nearest 3′ acceptor site, ranging from 18 to 40 bp, was found to affect splice site selection (Taggart et al. 2012; Clancy 2008). In the present study, the branch sites in the intron-retained cDNA were located at distances of 35 nt and 15 nt from the 3′ splice site for introns 1 and 2, respectively, which are within the distances suggested by Clancy (2008). No information was available for intron 3 since it was only 5′ partially retained.
For intron-splicing, pyrimidine tracts play a role. Strong pyrimidine tracts contain 11 continuous uridines, whereas decreasing the continuous uridine stretch to five or six residues requires that the tract be located immediately adjacent to the AG for optimal competitive efficiency (Coolidge et al. 1997). In the present investigation, intron retention may have been caused by the presence of weak pyrimidine tracts which contained only 6 and 4 non-continuous uridine stretch for intron 1 and intron 2, respectively, and are not immediately next to AG. No results were available for intron 3 since only the 5′ segment of intron 3 was present in the sequence. It is worth mentioning that the fully retained introns in buffalo CATH4 gene were short. Retaining introns in genes with short introns have been reported (Sakabe and de Souza 2007).
Recently, SNPs have been considered to play a role in alternative splicing leading to intron retention (Wang et al. 2014; Ju et al. 2015). Exonic SNPs have direct effects on the properties of proteins, while SNPs within introns and untranslated region can affect the expression and splicing of mRNA (Wang et al. 2013, 2014). In a study by Estivill (2015), it was reported that 20,000 SNPs were located close to splice sites. However, there were cases where SNPs were > 30 nucleotides away from the splice sites that disrupted splicing in 10,000 exons with evidence of alternative splicing and that splicing mutations located deeper in intronic regions (within 300 nucleotides from splice sites) were associated with disease. In the present study, 8 SNPs were detected in buffalo intron-retained cDNA, located at distances ranging from 7 to 246 nt away from the GT splice site which may be related to the retention of the three introns.
The presence of different CATH4 variants has been reported in other breeds of Indian buffaloes (Brahma et al. 2015). Individual buffalo from Mehsana and Murrah breeds was reported to carry 4-6 variants of CATH4 gene and that the gene could be present in multiple copies. Differences in CATH4 copy number has been reported to be breed-specific in indicine (Nelore) relative to the taurine cattle (Bickhart et al. 2012).
CATH4 in river buffalo (Egyptian breed) mRNA is present in spliced variant and intron-retained variants (three introns were retained) despite the presence of the conserved cis-sequences (the 4 splice signals) essential for accurate splicing. Retention of introns 1, 2, and 3 may have occurred as a result of short introns, weak polypyrimidine tracts containing 6 and 4 non-continuous uridine stretch, and/or SNPs located close to AG splice site.
Availability of data and materials
We declare that all data analyzed during this study are included in this published article.
Cluster of differentiation46
Bovine neutrophil cytosolic factor 4
Polymerase chain reaction
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990) Basic local alignment search tool. J Mol Biol. 215:403–410
Ausubel FM, Brent R, Kingston RE, Moore DD, Seidman JG, Smith JA, Struhl K (1990) Current protocols in molecular biology (editors). Greene Publishing and Wiley-Interscience, New York
Bickhart DM, Hou Y, Schroeder SG, Alkan C, Cardone MF, Matukumalli LK et al (2012) Copy number variation of individual cattle genomes using next-generation sequencing. Genome Res. 22:778–790
Black DL (2003) Mechanisms of alternative pre-mRNA splicing. Annu Rev Biochem. 27:291–336
Brahma B, Patra MC, Karri S, Chopra M, Mishra P et al (2015) Diversity, antimicrobial action and structure-activity relationship of buffalo cathelicidins. PLOS ONE. 10(12):e0144741
Chacko E, Ranganathan S (2009) Genome-wide analysis of alternative splicing in cow: implications in bovine as a model for human diseases. BMC Genomics. 10(Suppl3):S11
Clancy S (2008) RNA splicing: introns, exons and spliceosome. Nat Educ. 1(1):31
Coolidge CJ, Seely RJ, Patton JG (1997) Functional analysis of the polypyrimidine tract in pre-mRNA splicing. Nucleic Acids Res. 25(4):888–896
Corvelo A, Hallegger M, Smith CW, Eyras E (2010) Genome-wide association between branch point properties and alternative splicing. PLoS Comput Biol. 6(11):e1001016
Das H, Sharma B, Kumar A (2006) Cloning and characterization of novel cathelicidin cDNA sequence of Bubalus bubalis homologous to Bos taurus cathelicidin-4. DNA Seq. 17(6):407–414
Dirksen WP, Sun Q, Rottman FM (1995) Multiple splicing signals control alternative intron retention of bovine growth hormone pre-mRNA. J Biol Chem. 270:5346–5352
Dorschner RA, Pestonjamasp VK, Tamakuwala S, Ohtake T, Rudisill J, Nizet V et al (2001) Cutaneous injury induces the release of cathelicidin anti-microbial peptides active against group A Streptococcus. J Investig Dermatol. 117:91–97
Drögemüller C, Reichart U, Seuberlich T, Oevermann A, Baumgartner M, Kühni Boghenbor K et al (2011) An unusual splice defect in the mitofusin 2 gene (MFN2) is associated with degenerative axonopathy in Tyrolean Grey cattle. PLoS One 6:e18931
Estivill X (2015) Genetic variation and alternative splicing. Nat Biotechnol. 33(4):357–359
FAO (2013) FAO Statistical year book - world food and agriculture
Galante PA, Sakabe NJ, Kirschbaum-Slager N, de Souza SJ (2004) Detection and evaluation of intron retention events in the human transcriptome. RNA. 10:757–765
Gallo RL, Murakami M, Ohtake T, Zaiou M (2002) Biology and clinical relevance of naturally occurring antimicrobial peptides. J Allergy Clin Immunol. 110:823–831
Gillenwaters EN, Seabury CM, Elliott JS, Womack JE (2009) Sequence analysis and polymorphism discovery in 4 members of the bovine cathelicidin gene family. J Hered. 100(2):241–245
Ju Z, Wang C, Wang X, Yang C, Sun Y, Jiang Q et al (2015) Role of an SNP in alternative splicing of bovine NCF4 and mastitis susceptibility. PLoS ONE 10(11):e0143705
Lim LP, Burge CB (2001) A computational analysis of sequence features involved in recognition of short introns. Proc Natl Acad Sci USA. 98:11193–11198
Marai IFM, Habeeb AAM (2010) Buffaloes’ reproductive and productive traits as affected by heat stress. Tropical Subtropical Agroecosystems. 12:193–217
Menoud A, Welle M, Tetens J, Lichtner P, Drögemüller C (2012) A COL7A1 mutation causes dystrophic epidermolysis bullosa in Rotes Höhenvieh cattle. PLoS One. 7:e38823
Miller SA, Dykes DD, Polesky HF (1988) A simple salting out procedure for extracting DNA from human nucleated cells. Nucleic Acids Res. 16:1215
Sakabe NJ, de Souza SJ (2007) Sequence features responsible for intron retention in human. BMC Genomics. 8:59
Schmitz U, Pinello N, Jia F, Alasmari S, Ritchie W, Keightley MC, Shini S, Lieschke GJ, Wong JJ, Rasko JEJ (2017) Intron retention enhances gene regulatory complexity in vertebrates. Genome Biol. 18:216
Taggart AJ, DeSimone AM, Shih JS, Filloux ME, Fairbrother WG (2012) Large-scale mapping of branchpoints in human pre-mRNA transcripts in vivo. Nat Struct Mol Biol. 19(7):719–721
Untergasser A, Cutcutache I, koressaar T, ye J, Faircloth BC, Remm M, Rozen SG (2012) Primer 3 new capabilities and interfaces. Nucleic Acids Res. 40:e115
Wang X, Li T, Zhao HB, Khatib H (2013) A mutation in the 3′ untranslated region diminishes micro RNA binding and alters expression of the OLR1 gene. J Dairy Sci. 96:6525–6528
Wang X, Zhong J, Gao Y, Ju Z, Huang J (2014) A SNP in intron 8 of CD46 causes a novel transcript associated with mastitis in Holsteins. BMC Genomic. 15:630
Wong JJ, Au AY, Ritchie W, Rasko JE (2016) Intron retention in mRNA: no longer nonsense: known and putative roles of intron retention in normal and disease biology. Bioessays. 38(1):41–49
Zaiou M, Gallo RL (2002) Cathelicidins, essential gene-encoded mammalian antibiotics. J Mol Med. 80:549–561
Zanetti S, Deriu A, Volterra L, Falchi MP, Molicotti P, Fadda G, Sechi LA (2000) Virulence factors in Vibrio alginolyticus strains isolated from aquatic environments. Annali di Igiene. 12(6):487–491
No specific fund was supplied for this work.
Ethics approval and consent to participate
Blood samples used in this study was collected and provided by the buffalo farm experienced veterinary.
Consent for publication
The authors declare that they have no competing interests.
About this article
Cite this article
Abou Mossallam, A.A., El Nahas, S.M., Balabel, E.A. et al. Intron retention in Cathelicidin-4 in river buffalo. Bull Natl Res Cent 43, 116 (2019). https://doi.org/10.1186/s42269-019-0158-y