introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
28 rows where transcript_id = 720747
This data as json, CSV (advanced)
Suggested facets: score, length, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
3821641 | GT-AG | 0 | 1.000000099473604e-05 | 174 | rna-gnl|I4U23|003024-T1 720747 | 1 | 9498223 | 9498396 | Adineta vaga 104782 | CAG|GTAAGCAGAT...ACATCTTTAAAA/AAAATAATAATT...TATAG|GTT | 1 | 1 | 0.78 |
3821642 | GT-AG | 0 | 1.000000099473604e-05 | 97 | rna-gnl|I4U23|003024-T1 720747 | 2 | 9497238 | 9497334 | Adineta vaga 104782 | GCG|GTAGGTGAAC...TTTACTTTAATA/AATTGTTTTACT...TACAG|GAT | 1 | 1 | 11.598 |
3821643 | GT-AG | 0 | 6.672152174951543e-05 | 52 | rna-gnl|I4U23|003024-T1 720747 | 3 | 9496298 | 9496349 | Adineta vaga 104782 | GCG|GTAAGTTTTA...ACTTCTTTTAAC/ACAAAATTGACT...GTTAG|ATC | 1 | 1 | 22.417 |
3821644 | GT-AG | 0 | 4.079233726148156e-05 | 54 | rna-gnl|I4U23|003024-T1 720747 | 4 | 9496217 | 9496270 | Adineta vaga 104782 | AAG|GTTCATTTTT...TAAATTTTGAAT/TTTGAATTAATT...TCTAG|AAC | 1 | 1 | 22.746 |
3821645 | GT-AG | 0 | 1.000000099473604e-05 | 54 | rna-gnl|I4U23|003024-T1 720747 | 5 | 9496097 | 9496150 | Adineta vaga 104782 | AAA|GTGAAATTCT...TACTTCTTAATT/TCTTAATTCATA...GTCAG|GTT | 1 | 1 | 23.55 |
3821646 | GT-AG | 0 | 2.8554319246864748e-05 | 58 | rna-gnl|I4U23|003024-T1 720747 | 6 | 9495575 | 9495632 | Adineta vaga 104782 | AGT|GTAAGTATAA...TTGTACTTAAAT/CTTCTTCTGATA...AACAG|GAT | 0 | 1 | 29.203 |
3821647 | GT-AG | 0 | 1.000000099473604e-05 | 54 | rna-gnl|I4U23|003024-T1 720747 | 7 | 9495391 | 9495444 | Adineta vaga 104782 | CAG|GTGTTAATGA...TTTTCTTTGAAA/AACTTTTTGACT...CGTAG|GTA | 1 | 1 | 30.787 |
3821648 | GT-AG | 0 | 1.548710735252888e-05 | 55 | rna-gnl|I4U23|003024-T1 720747 | 8 | 9495137 | 9495191 | Adineta vaga 104782 | CGG|GTAAATCTTT...TTGTTTATAATC/ATGTTGTTTATA...TTCAG|TTG | 2 | 1 | 33.212 |
3821649 | GT-AG | 0 | 1.000000099473604e-05 | 62 | rna-gnl|I4U23|003024-T1 720747 | 9 | 9494863 | 9494924 | Adineta vaga 104782 | TCG|GTACGAAACA...CTTTTCTTTTCT/CTTTTCTTCAAA...ATTAG|GAA | 1 | 1 | 35.794 |
3821650 | GT-AG | 0 | 1.000000099473604e-05 | 53 | rna-gnl|I4U23|003024-T1 720747 | 10 | 9494555 | 9494607 | Adineta vaga 104782 | ATG|GTATGACAAA...CATGTCTTGTTT/TTCATACTAACA...TTTAG|GTT | 1 | 1 | 38.901 |
3821651 | GT-AG | 0 | 8.873659236736226e-05 | 51 | rna-gnl|I4U23|003024-T1 720747 | 11 | 9494156 | 9494206 | Adineta vaga 104782 | CTA|GTAAATAACA...CATCCTTTAATC/ATCTGTTTCATC...TGTAG|ATG | 1 | 1 | 43.141 |
3821652 | GT-AG | 0 | 0.0004023686947579 | 52 | rna-gnl|I4U23|003024-T1 720747 | 12 | 9494045 | 9494096 | Adineta vaga 104782 | AAT|GTTTGTATTT...TTGTTCATAGTC/TAATTATTTATC...TATAG|ATC | 0 | 1 | 43.86 |
3821653 | GT-AG | 0 | 1.000000099473604e-05 | 69 | rna-gnl|I4U23|003024-T1 720747 | 13 | 9493658 | 9493726 | Adineta vaga 104782 | ACG|GTAAGATTCT...GATACATTAATT/CATTAATTAATT...TATAG|GCG | 0 | 1 | 47.734 |
3821654 | GT-AG | 0 | 1.000000099473604e-05 | 60 | rna-gnl|I4U23|003024-T1 720747 | 14 | 9493483 | 9493542 | Adineta vaga 104782 | CAA|GTAAGATAAT...TTTTTCTTCTTT/AATGAATTAATC...AACAG|ATC | 1 | 1 | 49.135 |
3821655 | GT-AG | 0 | 2.011510951634679e-05 | 62 | rna-gnl|I4U23|003024-T1 720747 | 15 | 9493104 | 9493165 | Adineta vaga 104782 | CGA|GTCTGTGACT...TTTTCTTCAGTT/TTTTTCTTCAGT...TCTAG|TCG | 0 | 1 | 52.997 |
3821656 | GT-AG | 0 | 8.234643748765458e-05 | 58 | rna-gnl|I4U23|003024-T1 720747 | 16 | 9492637 | 9492694 | Adineta vaga 104782 | CAA|GTAATTTATC...CAGTTCATAGTT/TTTAGATTGATA...TTTAG|ATT | 1 | 1 | 57.98 |
3821657 | GT-AG | 0 | 2.0093230231380915e-05 | 61 | rna-gnl|I4U23|003024-T1 720747 | 17 | 9492313 | 9492373 | Adineta vaga 104782 | CTA|GTAAGTTAGT...AGTTCATTGAAA/TTTAAGTTCATT...TCAAG|GTT | 0 | 1 | 61.184 |
3821658 | GT-AG | 0 | 1.000000099473604e-05 | 54 | rna-gnl|I4U23|003024-T1 720747 | 18 | 9492031 | 9492084 | Adineta vaga 104782 | AAA|GTAAGAAAAT...AATATTCTAACA/AATATTCTAACA...TCTAG|TTG | 0 | 1 | 63.962 |
3821659 | GT-AG | 0 | 0.0001045225726667 | 51 | rna-gnl|I4U23|003024-T1 720747 | 19 | 9491762 | 9491812 | Adineta vaga 104782 | AAA|GTAGATTGAG...CTTTCTTTTTCA/AATATATTGATT...TTTAG|ATC | 2 | 1 | 66.618 |
3821660 | GT-AG | 0 | 1.7641636266589443e-05 | 52 | rna-gnl|I4U23|003024-T1 720747 | 20 | 9491544 | 9491595 | Adineta vaga 104782 | CAA|GTAAGTTGGT...CTTCTCTTGTTT/TCTTGTTTCACT...CATAG|ATT | 0 | 1 | 68.64 |
3821661 | GT-AG | 0 | 0.0014472239182762 | 61 | rna-gnl|I4U23|003024-T1 720747 | 21 | 9491192 | 9491252 | Adineta vaga 104782 | CAG|GTTTATTTAA...TTTGTTTTGATT/TTTGTTTTGATT...TTTAG|ATT | 0 | 1 | 72.186 |
3821662 | GT-AG | 0 | 0.0032808242755423 | 49 | rna-gnl|I4U23|003024-T1 720747 | 22 | 9490991 | 9491039 | Adineta vaga 104782 | ACT|GTACGTAGTA...TATTCCTTCTTT/TTTATTCAAATT...ATCAG|TGA | 2 | 1 | 74.038 |
3821663 | GT-AG | 0 | 1.000000099473604e-05 | 53 | rna-gnl|I4U23|003024-T1 720747 | 23 | 9490718 | 9490770 | Adineta vaga 104782 | AAA|GTAAATGATT...CATTTCTTTTCT/GTTTGTATCATT...CATAG|ATT | 0 | 1 | 76.718 |
3821664 | GT-AG | 0 | 0.0002855902555229 | 64 | rna-gnl|I4U23|003024-T1 720747 | 24 | 9490522 | 9490585 | Adineta vaga 104782 | AAT|GTTTGTTCTT...TTCACTCTAATA/AATAAATTCATT...TCTAG|GAT | 0 | 1 | 78.326 |
3821665 | GT-AG | 0 | 1.000000099473604e-05 | 52 | rna-gnl|I4U23|003024-T1 720747 | 25 | 9490212 | 9490263 | Adineta vaga 104782 | AAG|GTAAAAGTTT...TCTTCTTTGAAT/TTTGAATTTATA...TATAG|ATA | 0 | 1 | 81.469 |
3821666 | GT-AG | 0 | 0.0008105610009017 | 69 | rna-gnl|I4U23|003024-T1 720747 | 26 | 9489527 | 9489595 | Adineta vaga 104782 | AAC|GTAAGTTTTA...ATCTTTTTATCA/TTATTATTCATT...AATAG|TAT | 1 | 1 | 88.974 |
3821667 | GT-AG | 0 | 1.000000099473604e-05 | 48 | rna-gnl|I4U23|003024-T1 720747 | 27 | 9488732 | 9488779 | Adineta vaga 104782 | TAG|GTAGATTAAT...ATTTTGTTGAAA/ATTTTGTTGAAA...CTTAG|GTC | 1 | 1 | 98.075 |
3821668 | GT-AG | 0 | 0.0242699795232496 | 56 | rna-gnl|I4U23|003024-T1 720747 | 28 | 9488557 | 9488612 | Adineta vaga 104782 | AAT|GTATGTTAAT...TTCTTTTTAAAA/TTAAAATTGATT...ATTAG|GTT | 0 | 1 | 99.525 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);