The Pirarucu (Arapaima gigas) is among the world’s biggest freshwater fish and member of the particular superorder Osteoglossomorpha (bonytongues), a single of the oldest lineages of ray-finned fishes. This species is an obligate air-breather found in the particular basin of the Amazon . com River by having an attractive possible for aquaculture. Its phylogenetic position among bony these people own in makes the Pirarucu another subject for evolutionary scientific studies of early teleost diversity. Here, we present, the first time, a draft genome edition of the A. gigas genome, providing useful details for more functional and evolutionary studies. The A. gigas genome was assembled with 103-Gb raw reads sequenced within an Illumina platform. The particular final draft genome assemblage was ∼661 Mb, with the contig N50 corresponding to 51. 23 kb and scaffold N50 of 668 kb. Repeat sequences accounted for 21. 69% of the whole genome, and also a total of twenty-four, 655 protein-coding genes have been predicted from the genome assembly, with an average associated with nine exons per gene. Phylogenomic analysis based on 24 fish species supported the postulation that Osteoglossomorpha and Elopomorpha (eels, tarpons, and bonefishes) are sister groups, both forming the sister lineage with respect to Clupeocephala (remaining teleosts). Divergence period estimations suggested that Osteoglossomorpha and Elopomorpha lineages surfaced independently in a period of ∼30 Myr in the particular Jurassic. The draft genome of any. gigas provides a new valuable genetic resource for further investigations of major studies and may likewise give a valuable data with regard to monetary applications.
Arapaima gigas, also known as Pirarucu or Paiche, is a single of the world’s greatest freshwater fishes (Wijnstekers 2011) whose body length plus weight may attain 4. 5 m (15 ft) and 2 hundred Kg (440 lb), respectively (Nelson 1994; Froese and Pauly 2018). The genus Arapaima emerged in the Amazon online floodplain basin and is presently distributed in Brazilian, Colombia, Ecuador, and Peru (Hrbek et al. 2005, 2007; Froese and Pauly 2018), and also inside Thailand and Malaysia wherever it has been introduced for commercial fishing (Froese and Pauly 2018). Arapaima gigas local name (Pirarucu) derives through the indigenous Tupi words “pira” and “urucum” for “fish” and “red, ” respectively, presumably referring to its red end scales flecks in order to its reddish flesh (Marsden year 1994; Godinho et al. 2005). The peculiarity from the breathing apparatus is characteristic associated with this Amazonian fish, including gills and a lung-like tissue devised for air-breathing produced from a modified in addition to enlarged swim bladder (Burnie and Wilson 2001; Brauner et al. 2004). The Pirarucu has an appealing market value due to its low-fat and low bone content material. Overfishing practices in the Amazonian region led to the particular banning of Pirarucu commercialization by the Brazilian authorities in 2001, although consumption by the native population is usually currently permitted under rigid size and seasoning regulations (Bayley and Petrere 1989). Its main supply will be provided by wild-caught fish and fish farming performed by riverbank population regarding the Amazonas (Froese plus Pauly 2018). Aquaculture creation is attractive due to be able to high carcass yields in addition to rapid juvenile growth, along with yearlings reaching up to be able to 10 kg (22 lb) (Almeida et al. 2013).
Arapaima gigas : Pirarucu
Arapaima gigas belongs to the superorder Osteoglossomorpha of bony-tongued fishes whose tongue includes sharp bony teeth with regard to disabling and shredding preys (Sanford and Lauder 1990; Burnie and Wilson 2001). Together with Elopomorpha (eels and tarpons) and Clupeocephala (most of extant fish species), the Osteoglossomorpha comprises one of the three major teleosts groups whose phylogenetic position has been debatable (Le et al. 93; Inoue et al. 2003; Near et al. this year; Betancur-R 2013; Faircloth ainsi que al. 2013; Chen ainsi que al. 2015; Hughes ou al. 2018). Fossil information and some early molecular studies, including a current comprehensive analysis of > 300 Actinopterygii species (Hughes et al. 2018), positioned Osteoglossomorpha because the oldest teleost group (Greenwood 1970; Inoue et al. 2003), although other studies located Elopomorpha as the most ancestral one (Near et 's. 2012; Betancur-R 2013; Faircloth et al. 2013). Just lately, a phylogenetic study depending on whole genome sequencing in the bony-tongued Asian arowana (Scleropages formosus) suggested that the particular branching of Elopomorpha in addition to Osteoglossomorpha occurred almost simultaneously, positioning them as cousin lineages of Clupeocephala (Bian 2016). Within this framework, the genome of typically the Pirarucu provides new information to study the historical past of teleosts as nicely as providing useful information for sustainable exploration associated with this giant Amazon seafood. Here, we present typically the first whole genome assemblage, gene annotation, and phylogenomic inference of the Pirarucu that ought to facilitate the molecular characterization and conservation of this economically important species of fish species.
Aquarium Type:
Sample Collection and SequencingGenomic DNA was extracted from peripheral blood samples associated with four adult individuals (two males and two females) of Arapaima gigas: NCBI taxonomy ID 113544, FishBase ID: 2076. All selections were collected in accordance with the standards associated with the Federal University regarding Pará animal protocol. We applied a whole-genome shotgun sequencing strategy using a couple of short-insert libraries (400 plus 500 bp) in an Illumina HiSeq 2500 platform according in order to the manufacturer’s instructions (Illumina, San Diego, CA). HiSeq Rapid SBS Kits (FC-402-4021) and HiSeq Rapid Cluster Kits (PE-402-4002) were used to sequence paired-end go through of 2 × 250 base sets. Read quality was checked using FastQC, version zero. 11. 4 (Andrews 2010), and low-quality reads had been trimmed with Sickle paired-end (pe), version 1. 33 (Joshi and Fass 2011), under default parameters.Genome Size Estimation and Para Novo AssemblyGenome dimension was estimated based about the k-mer spectrum with the following formula: G= (N×(L−K + 1)−B)/D. Where N is the total read count, T will be the read length, K is k-mer length (K = 31), B is the complete low-frequency (frequency ≤1) k-mer count, D is the k-mer depth, and G is the genome dimension. Jellyfish 2. 2. six (Marçais and Kingsford 2011) was used to count k-mer frequencies of high-quality sequencing reads.Genome assembly was performed using SOAPdenovo2 (version 2. 04) (Luo et al. 2012) under default parameters (127mer version). Three assemblies were performed: 1) using all scans; 2) with reads through male samples; and 3) with reads from woman samples. Subsequently, gaps were filled using Redundants (Pryszcz and Gabaldón 2016) applying three-run scaffolding steps: first of all using the default value regarding minimum read pairs to joining contigs (5 pairs), subsequently rerunning with earlier data using a minimum benefit of four read pairs and, finally, utilizing a lowest of three read pairs. Assembly quality and statistics were assessed with QUAST (version 4. 4) (Gurevich et al. 2013).
THE FUCKING OCEAN YOU GUYS — Arapaima Gigas
Assessment of Genome CompletenessAssemblage quality was measured by assessing gene completeness along with Benchmarking Universal Single-Copy Orthologs (BUSCO) (Simão et ing. 2015) based on four, 584 BUSCO groups produced from Actinopterygii orthologs.Replicate AnalysisTransposable elements (TEs) and other repetitive factors of the Pirarucu genome were identified by a new combined, homology-based method and a de novo annotation approach. Initially, tandem repeats were identified with Conjunction Repeats Finder 4. 2009 (Benson 1999) with typically the following parameters: “Match=2, Mismatch=7, Delta=7, PM=80, PI=10, Minscore=50, and MaxPerid=2, 000. ” Additionally, a de novo repeat library was developed with RepeatModeler 1. zero. 9 and LTR_FINDER (Xu and Wang 2007), plus filtered with LTR_retriever (Ou and Jiang 2017) below default parameters. Subsequently, recognized and novel transposable elements were identified by umschlüsselung the assembled sequences towards the Repbase TE 22. 05 (Bao et al. 2015) and de novo repeat libraries using RepeatMasker four. 0 (Tarailo-Graovac and Chen 2009). In addition, all of us annotated TE-related proteins applying RepeatProteinMask 4. 0 (Tarailo-Graovac and Chen 2009).Gene Structure and Function AnnotationGenome annotation was carried out with the MAKER2 pipeline (Holt and Yandell 2011) within a two-pass iteration. 1st, homology annotation was carried out with protein data through Homo sapiens (human), Danio rerio (zebrafish), Takifugu rubripes (Japanese fugu), Tetraodon nigroviridis (spotted green pufferfish), Gasterosteus aculeatus (three-spined stickleback), Oryzias latipes (Japanese medaka), Latimeria chalumnae (coelacanth) (Ensembl launch 88), together with Scleropages formosus (Asian arowana) proteins sequences from NCBI RefSeq annotation data. Subsequently, de novo annotations were performed using the homology-based outcomes achieved in the first step. We also used RepeatModeller 1. 0. 9 (Smit and Hubley 2008) to build a de novo repeat library with default parameters. The GFF output from the first step was used to train the SNAP 20131129 (Korf 2004) and AUGUSTUS 3. 2. 3 (Stanke et al. 2008) predictors. GeneMark-ES 4. 32 (Lomsadze et al. 2005) was trained using the genome assembly itself. InterProScan 5. 24-63. 0 (Jones ainsi que al. 2014) was run on the protein output of MAKER, providing gene ontologies and classifying protein domain names and families. Protein result was compared using BLAST against the NCBI NR database (available on Might 29, 2017) for identifying putative gene names. Blast2GO v5 (Conesa et 's. 2005) was subsequently used to obtain Gene Ontology mapping and annotation (supplementary file S2, Supplementary Materials online).