Skip to main content

Intron retention in Cathelicidin-4 in river buffalo



The function of cathelicidins-4 (CATH4) is not limited to microbial killing, but extends to other aspects of immunity and tissue repair. The presence of different CATH4 variants including intron retention affects the immunity system. Intron retention, in buffalo, is not fully studied. In this study, we investigated CATH4 mRNA in river buffalo and their variants, which can be used in the future for selecting buffalo resistant to diseases.

Results and conclusion

Analysis of CATH4 mRNA in river buffalo (Egyptian breed) revealed the presence of a novel variant (1073 bp) which includes unspliced part of intron 3 (469 bp) in addition to previously reported unspliced complete intron 1 (103) and intron 2 (137 bp). Identification of intron retention was conducted by comparing the amplified unspliced cDNA and DNA sequences. Analysis of the 3 retained intronic regions revealed the presence of the 4 splice signals, needed for splicing which include the 5′ (GT) and 3′ (AG) intron splice sites, the branch point, and the polypyrimidine tract. However, in the intron-retained sequence, the polypyrimidine tract was weak. It contained 6 and 4 non-continuous uridine stretch in introns 1 and 2, respectively, (intron 3 was partial) which may have caused introns retention. In addition, analysis of the unspliced sequence showed three unique exonic SNPS located close to the splice sites (1 to 22 nucleotides) and five SNPs in retained intronic regions located near the splice sites (18 to 246 nucleotides away from exon/intron boundaries) which may be related to the retention of the three introns.


Cathelicidin-4 (CATH4), popularly known as indolicidin, has broad and rapid microbicidal effect that may be critically important to clear tissues from pathogens and to prevent the onset of infection (Dorschner et al. 2001). The function of cathelicidins is not limited to microbial killing, but extends to other aspects of immunity and tissue repair (Gallo et al. 2002). CATH4 contains 4 exons and 3 introns: the first 3 exons comprise signal peptide and cathelin prodomain (N-terminal) while the fourth exon encodes the cleavage site and variable C-terminal antimicrobial peptide (Zanetti et al. 2000; Zaiou and Gallo 2002).

Water buffalo (Bubalis bubalis) population includes river (Bubalus bubalis bubalis) and swamp buffalo (Bubalus bubalis carabanesis), 77% of which are river buffalo (FAO, 2013). Buffalo are major source of meat, milk, and its bi-products. Buffalo surpass the cattle in its ability to adapt to the hot, humid areas of muddy and swampy lands (Marai and Habeeb 2010). Buffalo CATH4 was cloned by Das et al. (2006). The complete CATH4 coding region was found to be 92.9% similar to Bos taurus nucleotide sequence. Several cathelicidin genes were identified in cattle and buffaloes. In cattle, SNPs, insertions, and deletions have been reported in different breeds of Bos taurus and Bos indicus (Gillenwaters et al. 2009). In a study by Brahma et al. (2015), amplicons of cathelicidin genes of 5 breeds of cattle and buffalo were investigated. Buffalo CATH4 genes showed higher single-nucleotide variations compared to cattle genes.

Constitutive splicing of intronic sequences from RNA is the dominant form of gene expression. However, alternative splicing leading to intron retention (IR) has been reported in many bovine genes (Chacko and Ranganathan 2009). Examples of IR have been found in bovine growth hormone (Dirksen et al. 1995), CD46 (Wang et al. 2014), and NCF4 (Ju et al. 2015). A higher relative frequency of IR has been associated with genes with overall shorter intron lengths (~ 100–200 nt), higher expression levels, weaker splice sites, and particular densities of Cis-regulatory elements (Sakabe and de Souza 2007). Recent evidences suggest that single-nucleotide polymorphisms (SNPs) are the main factor that contribute to the generation of alternative splice variants, which can cause degenerative axonopathy (Drögemüller et al. 2011) and a congenital mechanobullous skin disorder (Menoud et al. 2012) in cattle.

In this study, we investigated the different splice variants of mRNA CATH4 in river buffalo (Egyptian breed) which can be used for selecting disease-resistant breeds of buffalo.

Materials and methods

1. Collecting buffalo samples

Blood samples collected on ethylene diamine tetra acetic acid (EDTA) from healthy river buffalo (Egyptian breed) were kindly provided by the veterinarian of the buffalo farm “United Farms Group Company.”

2. DNA extraction

Genomic DNA was extracted from whole blood using salting out method according to Miller et al. (1988). The DNA concentrations were measured using Nanodrop 1000 (Thermoscientific) and were adjusted to 50 ng/μL for polymerase chain reaction (PCR).

3. mRNA extraction and cDNA synthesis

Total RNA from blood was extracted using Easy–RED™ iNtRON Biotechnology, Inc. according to the manufacturer’s instructions. RNA was considered to be free of DNA and proteins with a 260/280 optical density ratio of ~ 2.0. cDNA synthesis was performed using Revert Aid First Strand Synthesis Kit according to the manufacturer’s instructions. To ensure that the RNA was not contaminated with genomic DNA, a PCR reaction was performed using RNA in absence of reverse transcriptase, as a negative control.

4. Primers design

Two primer pairs were designed to investigate CATH4 in Egyptian buffalo, using Primer3 software (Untergasser and Cutcutache, 2012). Table 1 presents the primer pair sequences and the accession number from which they were designed.

Table 1 Sequences of primer pairs used

5. PCR amplification

PCR amplification for CATH4 was performed in 50 μl reaction volume which includes 20 μl of water (nuclease-free), 25 μl of PCR Master Mix (2X), 1 μl of forward primer (10 μM), 1 μl of reverse primers (10 μM), and 4 μl of 50 ng of DNA or cDNA template. The reaction mixture was run in a Q-Cycler, HVD LifeSciences. The thermal cycling program was initial denaturation at 95 °C for 3 min followed by 40 cycles of the following: denaturation at 95 °C for 30s, annealing for 30s (54 °C or 62 °C), extension at 75 °C for 1 min, and then final extension at 75 °C for 10 min. Detection of PCR products were performed by agarose gel electrophoresis according to the method described by Ausubel et al. (1990). The gels were inspected by Gel documentation system (In Genius, Syngenebioimaging). PCR products were purified using MEGAquick-spinTM Total Fragment DNA Purification Kit (iNtRON biotechnology) according to the kit’s instructions.

6. Sequence analysis and SNPs identification

Purified PCR products were sequenced by Macrogen (Korea) using reverse and forward primers. The specificity of the nucleotide sequences were verified by BLAST analysis (Basic Local Alignment Search Tool) ( (Altschul et al. 1990). Sequences were analyzed by multiple alignments using Clustal Omega ( Polymorphic sites were determined by visual examination of sequence’s charts.

7. Determining the four potential splice sites

To predict the potential splice sites 5′ and 3′ (GT/AG), the Splice Port tool ( was used. Prediction of the branch points (BPs) and polypyrimidine tract (ppt) were carried out using SVM-BP finder tool ( (Corvelo et al. 2010).


PCR was conducted on cDNA of buffalo. Only in three samples that the first primer pairs resulted in 526 bp amplicon which corresponded to CATH4 full-spliced coding region with complete 4 exons and part of 3′ untranslated region (Fig. 1). The second primer pair was used with the cDNA samples that were not amplified by the first primer. It amplified segments of 1067 bp (Fig. 2). In the alignment between the two amplified amplicons, 526 bp (full-spliced segment) and 1067 bp segments showed three separate matches (Fig. 3) covering exons1, 2, and 3. This left segments of 102 bp between exons 1 and 2, 136 bp between exons 2 and 3, and 469 bp after exon 3.

Fig. 1

Buffalo CATH4 full-spliced cDNA sequence. Exons 1 and 3 are underlined, and 2 and 4 are in bold. 3′ Untranslated region is in italic. SNPs: M (A/C), R (G/A), K (T/G), Y(C/T), and W (T/A)

Fig. 2

Buffalo CATH4 unspliced cDNA sequence. Starting position is c.21. Introns are in italic with the 5′ and 3′ splice site (GT-AG). Exons 1, 2 and 3 are in small letters. SNPs: M(A/C), R(G/A), K(T/G), Y( C/T), S(G/C), and W(T/A)

Fig. 3

Alignment between the spliced and unspliced CATH4 amplicons showing three separate matches which correspond to exons 1 (a), 2 (b), and 3 (c)

In order to determine the nature of the sequences between the exons and beyond, a PCR reaction was conducted using the second primer pair on genomic DNA buffalo sample. The resulted DNA amplicon was aligned with the cDNA (1067 bp segment) using Clustal Omega ( (Fig. 4).

Fig. 4

Multiple sequence alignment of buffalo CATH4 genomic DNA and 1067 bp cDNA. Introns are in capital; exons 1, 2, and 3 are in small letters. SNPs: M(A/C), R(G/A), K(T/G), Y( C/T), S(G/C), and W(T/A)

The alignment showed that the 1067 bp cDNA (accession# MK507762.1) is an intron-retained amplicon with full retention of intron 1 (102 bp) and intron 2 (136 bp) and a large segment of 5′ of intron 3 (469 bp). In order to find out reasons for unsplicing of the introns, the cDNA intron-retained sequence was analyzed. The analysis includes identifying the locations of any occurring SNPs and the sequences of cis-acting 4 splice signs, needed for splicing. The latter includes the 5′ (GT) and 3′ (AG) intron splice sites, the branch points (BPs), and polypyrimidine tract (ppt).

Prediction of the potential 5′ and 3′ splice sites (GT-AG) in CATH4 mRNA intron-retained form was determined using the Splice Port tool (, and feature generation algorithm (FGA) scores were calculated (Table 2). The analysis showed that the donor splice sites in the three introns are the top splice site candidates which indicate that the three introns should have been spliced.

Table 2 The potential predicted 5′ and 3′ splice sites (GT-AG) in Buffalo CATH4 unspliced cDNA sequence

The potential branch points (BPs) and polypyrimidine tract (ppt) in introns 1 and 2 (intron 3 was only 5′ partial) of the intron-retained sequence were investigated using the SVM-BP finder (

The candid positions were the ones with svm_score of 1.3308 and 0.9529 located at distances of 35 nt and 15 nt from the 3′ splice site of intron 1 and intron 2, respectively, (Table 3).

Table 3 The potential predicted branch point (BP) and polypyrimidine tract (ppt) in buffalo CATH4 unspliced cDNA sequence

Figure 5 shows all candidate branch points and polypyrimidine tract in intron 1 and intron 2 nucleotide sequences of the intron-retained cDNA. In intron 1 and intron 2, the uridine tracts had only 6 and 4 non-continuous uridines (t), respectively.

Fig. 5

SVM-BP finder outputs all candidate branch points (BPs) of Egyptian buffalo CATH4 intron-retained cDNA. Where BP adenine (A) are in capital letters, BP sequence (nonamer; from −5 to +3 relative to the BP adenine) are underlined, and polypyrimidine tract length (ppt_len) are in italic

SNPs positions relative to splicing sites

Polymorphic sites and their distances from the GT splice site were determined (Table 4) in nucleotide sequences of the spliced cDNA (8 SNPs) and the intron-retained cDNA (13 SNPs; 8 in exonic regions and 5 in intron-retained segments).

Table 4 Single-nucleotide polymorphisms in spliced and Intron-retained sequences of Egyptian buffalo CATH4 mRNA and their distances from GT splice site


Intron retention, a form of alternative splicing that affects the mechanism of gene expression control in mammals, is not fully yet studied. It enhances gene regulatory complexity in vertebrates (Schmitz et al. 2017) and plays an essential conserved role in normal physiology and in diverse diseases (Wong et al. 2016). It was in 1997 that Coolidge et al. reported the role of the cis-regulatory elements in intron retention. Galante et al. (2004) and Sakabe and de Souza (2007) reported that retained introns are on the average shorter, more C/G rich, and associated with weaker splice sites than constitutive introns.

In the present work, we have detected retention of the three introns in CATH4 of Egyptian buffalo. Accurate intron splicing occurs in the presence of strong cis-acting 4 splice signs which include the 5′ (GT) and 3′ (AG) intron splice sites, the branch points (BPs), and polypyrimidine tract (ppt) sequences which are essential for accurate splicing (Black 2003). However, these cis-sequences provide only one half of the information required for recognition by the splicing machinery (Lim and Burge 2001). The number of nucleotides between the branchpoint and the nearest 3′ acceptor site, ranging from 18 to 40 bp, was found to affect splice site selection (Taggart et al. 2012; Clancy 2008). In the present study, the branch sites in the intron-retained cDNA were located at distances of 35 nt and 15 nt from the 3′ splice site for introns 1 and 2, respectively, which are within the distances suggested by Clancy (2008). No information was available for intron 3 since it was only 5′ partially retained.

For intron-splicing, pyrimidine tracts play a role. Strong pyrimidine tracts contain 11 continuous uridines, whereas decreasing the continuous uridine stretch to five or six residues requires that the tract be located immediately adjacent to the AG for optimal competitive efficiency (Coolidge et al. 1997). In the present investigation, intron retention may have been caused by the presence of weak pyrimidine tracts which contained only 6 and 4 non-continuous uridine stretch for intron 1 and intron 2, respectively, and are not immediately next to AG. No results were available for intron 3 since only the 5′ segment of intron 3 was present in the sequence. It is worth mentioning that the fully retained introns in buffalo CATH4 gene were short. Retaining introns in genes with short introns have been reported (Sakabe and de Souza 2007).

Recently, SNPs have been considered to play a role in alternative splicing leading to intron retention (Wang et al. 2014; Ju et al. 2015). Exonic SNPs have direct effects on the properties of proteins, while SNPs within introns and untranslated region can affect the expression and splicing of mRNA (Wang et al. 2013, 2014). In a study by Estivill (2015), it was reported that 20,000 SNPs were located close to splice sites. However, there were cases where SNPs were > 30 nucleotides away from the splice sites that disrupted splicing in 10,000 exons with evidence of alternative splicing and that splicing mutations located deeper in intronic regions (within 300 nucleotides from splice sites) were associated with disease. In the present study, 8 SNPs were detected in buffalo intron-retained cDNA, located at distances ranging from 7 to 246 nt away from the GT splice site which may be related to the retention of the three introns.

The presence of different CATH4 variants has been reported in other breeds of Indian buffaloes (Brahma et al. 2015). Individual buffalo from Mehsana and Murrah breeds was reported to carry 4-6 variants of CATH4 gene and that the gene could be present in multiple copies. Differences in CATH4 copy number has been reported to be breed-specific in indicine (Nelore) relative to the taurine cattle (Bickhart et al. 2012).


CATH4 in river buffalo (Egyptian breed) mRNA is present in spliced variant and intron-retained variants (three introns were retained) despite the presence of the conserved cis-sequences (the 4 splice signals) essential for accurate splicing. Retention of introns 1, 2, and 3 may have occurred as a result of short introns, weak polypyrimidine tracts containing 6 and 4 non-continuous uridine stretch, and/or SNPs located close to AG splice site.

Availability of data and materials

We declare that all data analyzed during this study are included in this published article.



Branch points




Cluster of differentiation46


Intron retention


Bovine neutrophil cytosolic factor 4




Polymerase chain reaction


Polypyrimidine tract


Single-nucleotide polymorphisms


  1. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990) Basic local alignment search tool. J Mol Biol. 215:403–410

    CAS  Article  Google Scholar 

  2. Ausubel FM, Brent R, Kingston RE, Moore DD, Seidman JG, Smith JA, Struhl K (1990) Current protocols in molecular biology (editors). Greene Publishing and Wiley-Interscience, New York

    Google Scholar 

  3. Bickhart DM, Hou Y, Schroeder SG, Alkan C, Cardone MF, Matukumalli LK et al (2012) Copy number variation of individual cattle genomes using next-generation sequencing. Genome Res. 22:778–790

    CAS  Article  Google Scholar 

  4. Black DL (2003) Mechanisms of alternative pre-mRNA splicing. Annu Rev Biochem. 27:291–336

    Article  Google Scholar 

  5. Brahma B, Patra MC, Karri S, Chopra M, Mishra P et al (2015) Diversity, antimicrobial action and structure-activity relationship of buffalo cathelicidins. PLOS ONE. 10(12):e0144741

    Article  Google Scholar 

  6. Chacko E, Ranganathan S (2009) Genome-wide analysis of alternative splicing in cow: implications in bovine as a model for human diseases. BMC Genomics. 10(Suppl3):S11

    Article  Google Scholar 

  7. Clancy S (2008) RNA splicing: introns, exons and spliceosome. Nat Educ. 1(1):31

    MathSciNet  Google Scholar 

  8. Coolidge CJ, Seely RJ, Patton JG (1997) Functional analysis of the polypyrimidine tract in pre-mRNA splicing. Nucleic Acids Res. 25(4):888–896

    CAS  Article  Google Scholar 

  9. Corvelo A, Hallegger M, Smith CW, Eyras E (2010) Genome-wide association between branch point properties and alternative splicing. PLoS Comput Biol. 6(11):e1001016

    ADS  MathSciNet  Article  Google Scholar 

  10. Das H, Sharma B, Kumar A (2006) Cloning and characterization of novel cathelicidin cDNA sequence of Bubalus bubalis homologous to Bos taurus cathelicidin-4. DNA Seq. 17(6):407–414

    CAS  Article  Google Scholar 

  11. Dirksen WP, Sun Q, Rottman FM (1995) Multiple splicing signals control alternative intron retention of bovine growth hormone pre-mRNA. J Biol Chem. 270:5346–5352

    CAS  Article  Google Scholar 

  12. Dorschner RA, Pestonjamasp VK, Tamakuwala S, Ohtake T, Rudisill J, Nizet V et al (2001) Cutaneous injury induces the release of cathelicidin anti-microbial peptides active against group A Streptococcus. J Investig Dermatol. 117:91–97

    CAS  Article  Google Scholar 

  13. Drögemüller C, Reichart U, Seuberlich T, Oevermann A, Baumgartner M, Kühni Boghenbor K et al (2011) An unusual splice defect in the mitofusin 2 gene (MFN2) is associated with degenerative axonopathy in Tyrolean Grey cattle. PLoS One 6:e18931

    ADS  Article  Google Scholar 

  14. Estivill X (2015) Genetic variation and alternative splicing. Nat Biotechnol. 33(4):357–359

    CAS  Article  Google Scholar 

  15. FAO (2013) FAO Statistical year book - world food and agriculture

  16. Galante PA, Sakabe NJ, Kirschbaum-Slager N, de Souza SJ (2004) Detection and evaluation of intron retention events in the human transcriptome. RNA. 10:757–765

    CAS  Article  Google Scholar 

  17. Gallo RL, Murakami M, Ohtake T, Zaiou M (2002) Biology and clinical relevance of naturally occurring antimicrobial peptides. J Allergy Clin Immunol. 110:823–831

    CAS  Article  Google Scholar 

  18. Gillenwaters EN, Seabury CM, Elliott JS, Womack JE (2009) Sequence analysis and polymorphism discovery in 4 members of the bovine cathelicidin gene family. J Hered. 100(2):241–245

    CAS  Article  Google Scholar 

  19. Ju Z, Wang C, Wang X, Yang C, Sun Y, Jiang Q et al (2015) Role of an SNP in alternative splicing of bovine NCF4 and mastitis susceptibility. PLoS ONE 10(11):e0143705

    Article  Google Scholar 

  20. Lim LP, Burge CB (2001) A computational analysis of sequence features involved in recognition of short introns. Proc Natl Acad Sci USA. 98:11193–11198

    ADS  CAS  Article  Google Scholar 

  21. Marai IFM, Habeeb AAM (2010) Buffaloes’ reproductive and productive traits as affected by heat stress. Tropical Subtropical Agroecosystems. 12:193–217

    Google Scholar 

  22. Menoud A, Welle M, Tetens J, Lichtner P, Drögemüller C (2012) A COL7A1 mutation causes dystrophic epidermolysis bullosa in Rotes Höhenvieh cattle. PLoS One. 7:e38823

    ADS  CAS  Article  Google Scholar 

  23. Miller SA, Dykes DD, Polesky HF (1988) A simple salting out procedure for extracting DNA from human nucleated cells. Nucleic Acids Res. 16:1215

    CAS  Article  Google Scholar 

  24. Sakabe NJ, de Souza SJ (2007) Sequence features responsible for intron retention in human. BMC Genomics. 8:59

    Article  Google Scholar 

  25. Schmitz U, Pinello N, Jia F, Alasmari S, Ritchie W, Keightley MC, Shini S, Lieschke GJ, Wong JJ, Rasko JEJ (2017) Intron retention enhances gene regulatory complexity in vertebrates. Genome Biol. 18:216

    Article  Google Scholar 

  26. Taggart AJ, DeSimone AM, Shih JS, Filloux ME, Fairbrother WG (2012) Large-scale mapping of branchpoints in human pre-mRNA transcripts in vivo. Nat Struct Mol Biol. 19(7):719–721

    CAS  Article  Google Scholar 

  27. Untergasser A, Cutcutache I, koressaar T, ye J, Faircloth BC, Remm M, Rozen SG (2012) Primer 3 new capabilities and interfaces. Nucleic Acids Res. 40:e115

    CAS  Article  Google Scholar 

  28. Wang X, Li T, Zhao HB, Khatib H (2013) A mutation in the 3′ untranslated region diminishes micro RNA binding and alters expression of the OLR1 gene. J Dairy Sci. 96:6525–6528

    CAS  Article  Google Scholar 

  29. Wang X, Zhong J, Gao Y, Ju Z, Huang J (2014) A SNP in intron 8 of CD46 causes a novel transcript associated with mastitis in Holsteins. BMC Genomic. 15:630

    Article  Google Scholar 

  30. Wong JJ, Au AY, Ritchie W, Rasko JE (2016) Intron retention in mRNA: no longer nonsense: known and putative roles of intron retention in normal and disease biology. Bioessays. 38(1):41–49

    Article  Google Scholar 

  31. Zaiou M, Gallo RL (2002) Cathelicidins, essential gene-encoded mammalian antibiotics. J Mol Med. 80:549–561

    CAS  Article  Google Scholar 

  32. Zanetti S, Deriu A, Volterra L, Falchi MP, Molicotti P, Fadda G, Sechi LA (2000) Virulence factors in Vibrio alginolyticus strains isolated from aquatic environments. Annali di Igiene. 12(6):487–491

    CAS  PubMed  Google Scholar 

Download references


Not applicable


No specific fund was supplied for this work.

Author information




AAAM and SMEN, designed the experiment, analyzed the data, and wrote the manuscript. EAB and ETS conducted the practical section of the work. NMO carried out the statistical analysis. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Ahlam A. Abou Mossallam.

Ethics declarations

Ethics approval and consent to participate

Blood samples used in this study was collected and provided by the buffalo farm experienced veterinary.

Consent for publication

Not applicable

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Abou Mossallam, A.A., El Nahas, S.M., Balabel, E.A. et al. Intron retention in Cathelicidin-4 in river buffalo. Bull Natl Res Cent 43, 116 (2019).

Download citation


  • Egyptian buffalo
  • Cathelicidine-4
  • CATH4 splice sites
  • cDNA
  • SNPs