introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
54 rows where transcript_id = 21436521
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 115633320 | GT-AG | 0 | 1.000000099473604e-05 | 4728 | rna-XM_034060800.1 21436521 | 1 | 98185267 | 98189994 | Melopsittacus undulatus 13146 | AAG|GTGAGGTCGG...GGGATCTGATTG/TGGGATCTGATT...TGCAG|AAG | 0 | 1 | 1.296 |
| 115633321 | GT-AG | 0 | 1.000000099473604e-05 | 201 | rna-XM_034060800.1 21436521 | 2 | 98184977 | 98185177 | Melopsittacus undulatus 13146 | GAG|GTGAGACCAC...TCTGTGTTGCCC/TGTGGGGTCACA...CACAG|GCC | 2 | 1 | 2.364 |
| 115633322 | GT-AG | 0 | 1.000000099473604e-05 | 536 | rna-XM_034060800.1 21436521 | 3 | 98184276 | 98184811 | Melopsittacus undulatus 13146 | CAG|GTGGGTGACA...ACCCCCGTACCT/CGATGTGTTATA...TGCAG|GAA | 2 | 1 | 4.344 |
| 115633323 | GT-AG | 0 | 1.000000099473604e-05 | 119 | rna-XM_034060800.1 21436521 | 4 | 98184075 | 98184193 | Melopsittacus undulatus 13146 | AAG|GTGACACAGG...GAAGGTTTAATT/GAAGGTTTAATT...GGCAG|CTG | 0 | 1 | 5.328 |
| 115633324 | GT-AG | 0 | 1.000000099473604e-05 | 76 | rna-XM_034060800.1 21436521 | 5 | 98183877 | 98183952 | Melopsittacus undulatus 13146 | CGG|GTGAGTGGGG...TCCTCCTCATTT/CTCCTCCTCATT...TCCAG|GGA | 2 | 1 | 6.791 |
| 115633325 | GT-AG | 0 | 1.000000099473604e-05 | 1011 | rna-XM_034060800.1 21436521 | 6 | 98182783 | 98183793 | Melopsittacus undulatus 13146 | AAG|GTAAAAGGGG...TTTCCCTTCTCA/TCCCTTCTCATC...TGCAG|AGA | 1 | 1 | 7.787 |
| 115633326 | GT-AG | 0 | 8.129218492032225e-05 | 487 | rna-XM_034060800.1 21436521 | 7 | 98182210 | 98182696 | Melopsittacus undulatus 13146 | AAG|GTAGCACTGG...GGCACCATAAAA/CCCCCCATCATC...GACAG|CCC | 0 | 1 | 8.819 |
| 115633327 | GT-AG | 0 | 1.000000099473604e-05 | 71 | rna-XM_034060800.1 21436521 | 8 | 98181853 | 98181923 | Melopsittacus undulatus 13146 | TGG|GTAGGTGGAT...ACCATCTGAACT/TGGGTGCTGAAC...TGTAG|ATG | 1 | 1 | 12.251 |
| 115633328 | GT-AG | 0 | 9.71329027217698e-05 | 205 | rna-XM_034060800.1 21436521 | 9 | 98181541 | 98181745 | Melopsittacus undulatus 13146 | AAG|GTACCAGTGT...CACCCTGTACCG/CTGAAGCTGACA...CACAG|CTC | 0 | 1 | 13.535 |
| 115633329 | GT-AG | 0 | 0.0078128347678529 | 94 | rna-XM_034060800.1 21436521 | 10 | 98181363 | 98181456 | Melopsittacus undulatus 13146 | CAG|GTATCATGTG...AATCCCATGGCT/AGCAGACTCACC...CATAG|GTC | 0 | 1 | 14.543 |
| 115633330 | GT-AG | 0 | 1.000000099473604e-05 | 138 | rna-XM_034060800.1 21436521 | 11 | 98181141 | 98181278 | Melopsittacus undulatus 13146 | AAG|GTGGGCAGCG...TGGGGTTCAGCA/GTGGGGTTCAGC...TGCAG|GAG | 0 | 1 | 15.551 |
| 115633331 | GT-AG | 0 | 1.000000099473604e-05 | 529 | rna-XM_034060800.1 21436521 | 12 | 98180513 | 98181041 | Melopsittacus undulatus 13146 | ATG|GTGAGCATCC...AGGGTTGTACCA/GTTGTACCAACC...GGCAG|GCA | 0 | 1 | 16.739 |
| 115633332 | GT-AG | 0 | 1.000000099473604e-05 | 87 | rna-XM_034060800.1 21436521 | 13 | 98179854 | 98179940 | Melopsittacus undulatus 13146 | CAG|GTGATGCTCA...TGCTCCTTCCCC/ACCACCCTCACT...ACCAG|CTT | 2 | 1 | 23.602 |
| 115633333 | GT-AG | 0 | 1.000000099473604e-05 | 100 | rna-XM_034060800.1 21436521 | 14 | 98179621 | 98179720 | Melopsittacus undulatus 13146 | TGG|GTGAGCATCG...CCCTCCTTGTCC/GGATGCCTGATG...CCCAG|CAC | 0 | 1 | 25.198 |
| 115633334 | GT-AG | 0 | 1.000000099473604e-05 | 81 | rna-XM_034060800.1 21436521 | 15 | 98179420 | 98179500 | Melopsittacus undulatus 13146 | GAG|GTAGGAGACC...GGGATCTGAGCA/GCACAGTTCATC...TCCAG|TCC | 0 | 1 | 26.638 |
| 115633335 | GT-AG | 0 | 2.022517405160177e-05 | 93 | rna-XM_034060800.1 21436521 | 16 | 98178996 | 98179088 | Melopsittacus undulatus 13146 | CAG|GTATGTCAGG...AGGGTCTTCTCC/CCCCAGCTCATC...CTCAG|GAC | 1 | 1 | 30.61 |
| 115633336 | GT-AG | 0 | 1.000000099473604e-05 | 105 | rna-XM_034060800.1 21436521 | 17 | 98178808 | 98178912 | Melopsittacus undulatus 13146 | CAG|GTGAGGAGAG...ATGCTCTTATGG/GATGCTCTTATG...TCTAG|GCC | 0 | 1 | 31.605 |
| 115633337 | GT-AG | 0 | 0.0001325050964768 | 457 | rna-XM_034060800.1 21436521 | 18 | 98178258 | 98178714 | Melopsittacus undulatus 13146 | AAG|GTATGTTGGG...AGCCTCTCACCT/AAGCCTCTCACC...GGCAG|GAT | 0 | 1 | 32.721 |
| 115633338 | GT-AG | 0 | 1.000000099473604e-05 | 556 | rna-XM_034060800.1 21436521 | 19 | 98177530 | 98178085 | Melopsittacus undulatus 13146 | CAG|GTGAGGACCT...CTGCCCTGTGCC/ACAGGGCTCACA...TGCAG|AGA | 1 | 1 | 34.785 |
| 115633339 | GT-AG | 0 | 2.660665709565589e-05 | 96 | rna-XM_034060800.1 21436521 | 20 | 98177300 | 98177395 | Melopsittacus undulatus 13146 | AAG|GTGTGCTGGG...CAAGCCTCACCT/TCACCTCTCACC...GGCAG|GTC | 0 | 1 | 36.393 |
| 115633340 | GT-AG | 0 | 1.000000099473604e-05 | 74 | rna-XM_034060800.1 21436521 | 21 | 98177066 | 98177139 | Melopsittacus undulatus 13146 | TAG|GTGGGAGCTG...GCCTCCATGGCT/TGGCTACTCACA...TGCAG|GTC | 1 | 1 | 38.313 |
| 115633341 | GT-AG | 0 | 1.000000099473604e-05 | 552 | rna-XM_034060800.1 21436521 | 22 | 98176411 | 98176962 | Melopsittacus undulatus 13146 | TGG|GTGAGCTGCC...ATGCCCATGTAT/CAGTTGCACACA...ATCAG|CAC | 2 | 1 | 39.549 |
| 115633342 | GT-AG | 0 | 1.000000099473604e-05 | 136 | rna-XM_034060800.1 21436521 | 23 | 98176121 | 98176256 | Melopsittacus undulatus 13146 | CAG|GTACAGCACC...AGCCCCATGTCC/TGAACAATCAGC...TGCAG|GTC | 0 | 1 | 41.397 |
| 115633343 | GT-AG | 0 | 1.000000099473604e-05 | 79 | rna-XM_034060800.1 21436521 | 24 | 98175883 | 98175961 | Melopsittacus undulatus 13146 | AAG|GTAAGGCAGA...GGGGCTTTGCTG/CTCTTGCTCAGG...ACTAG|GTC | 0 | 1 | 43.305 |
| 115633344 | GT-AG | 0 | 1.000000099473604e-05 | 173 | rna-XM_034060800.1 21436521 | 25 | 98175550 | 98175722 | Melopsittacus undulatus 13146 | CAG|GTACTATGTG...GTGCTCTCTCCT/AGGTGTCCCACC...CATAG|ACC | 1 | 1 | 45.224 |
| 115633345 | GT-AG | 0 | 1.000000099473604e-05 | 290 | rna-XM_034060800.1 21436521 | 26 | 98175174 | 98175463 | Melopsittacus undulatus 13146 | AAG|GTGAGCCAGG...TCCCCCTTTGCT/GCTGGGGTAACA...TGCAG|CTC | 0 | 1 | 46.256 |
| 115633346 | GT-AG | 0 | 1.000000099473604e-05 | 78 | rna-XM_034060800.1 21436521 | 27 | 98174427 | 98174504 | Melopsittacus undulatus 13146 | CAG|GTGAGCCCCA...AACCCCTCACCC/GAACCCCTCACC...GGCAG|CAA | 0 | 1 | 54.284 |
| 115633347 | GT-AG | 0 | 1.000000099473604e-05 | 74 | rna-XM_034060800.1 21436521 | 28 | 98174174 | 98174247 | Melopsittacus undulatus 13146 | CAG|GTGAGTCCTG...GCTGCCTGCTCT/GATGGGTGCATG...TGCAG|CCT | 2 | 1 | 56.431 |
| 115633348 | GT-AG | 0 | 1.000000099473604e-05 | 900 | rna-XM_034060800.1 21436521 | 29 | 98173114 | 98174013 | Melopsittacus undulatus 13146 | AAG|GTGCATGGGC...CTCACTGTGCCC/GTGGGGCTCACT...CCCAG|ATC | 0 | 1 | 58.351 |
| 115633349 | GT-AG | 0 | 1.000000099473604e-05 | 81 | rna-XM_034060800.1 21436521 | 30 | 98172895 | 98172975 | Melopsittacus undulatus 13146 | CAG|GTGGGTCACC...CAGCCCTTGAGG/CCCCATCTGATG...CCCAG|TTG | 0 | 1 | 60.007 |
| 115633350 | GT-AG | 0 | 1.000000099473604e-05 | 216 | rna-XM_034060800.1 21436521 | 31 | 98172373 | 98172588 | Melopsittacus undulatus 13146 | CAC|GTGGGTGCCA...CCCCTCTTGTCC/TGGGTGTCCAGC...GGTAG|GTG | 0 | 1 | 63.679 |
| 115633351 | GT-AG | 0 | 1.000000099473604e-05 | 862 | rna-XM_034060800.1 21436521 | 32 | 98171361 | 98172222 | Melopsittacus undulatus 13146 | CAG|GTATTGCCTG...TGTCACTTCACT/TGTCACTTCACT...GGCAG|GAG | 0 | 1 | 65.479 |
| 115633352 | GT-AG | 0 | 1.000000099473604e-05 | 79 | rna-XM_034060800.1 21436521 | 33 | 98171124 | 98171202 | Melopsittacus undulatus 13146 | CCG|GTGAGCCCCA...GGTGCCATGACT/GGGTGCCTCACC...CCCAG|GAA | 2 | 1 | 67.375 |
| 115633353 | GT-AG | 0 | 1.000000099473604e-05 | 187 | rna-XM_034060800.1 21436521 | 34 | 98170812 | 98170998 | Melopsittacus undulatus 13146 | TGG|GTGAGCCTGA...TCCCCCCCAGCT/AGCCTGCTGACC...TGCAG|GAG | 1 | 1 | 68.874 |
| 115633354 | GT-AG | 0 | 1.000000099473604e-05 | 88 | rna-XM_034060800.1 21436521 | 35 | 98170603 | 98170690 | Melopsittacus undulatus 13146 | CCA|GTGAGTGTTC...GGGGTCTTACCC/TGGGGTCTTACC...CCCAG|GGC | 2 | 1 | 70.326 |
| 115633355 | GT-AG | 0 | 1.000000099473604e-05 | 219 | rna-XM_034060800.1 21436521 | 36 | 98170226 | 98170444 | Melopsittacus undulatus 13146 | GAG|GTAACAGCGT...GTGTCCACATCT/TGTGTCCACATC...GGCAG|GGA | 1 | 1 | 72.222 |
| 115633356 | GT-AG | 0 | 1.000000099473604e-05 | 94 | rna-XM_034060800.1 21436521 | 37 | 98169998 | 98170091 | Melopsittacus undulatus 13146 | AAG|GTGGTGGTGA...TGCTCCATGTCC/CTCCCCTCCATC...CACAG|GTG | 0 | 1 | 73.83 |
| 115633357 | GT-AG | 0 | 0.0411766479745787 | 413 | rna-XM_034060800.1 21436521 | 38 | 98169477 | 98169889 | Melopsittacus undulatus 13146 | CAG|GTACCCCACA...CCACCCTTCCTT/TCTCTGCCAACC...CCCAG|AAA | 0 | 1 | 75.126 |
| 115633358 | GT-AG | 0 | 1.000000099473604e-05 | 77 | rna-XM_034060800.1 21436521 | 39 | 98169304 | 98169380 | Melopsittacus undulatus 13146 | GTG|GTGAGCCCCA...CCAGCCTCACCC/TCCAGCCTCACC...CCCAG|TTC | 0 | 1 | 76.278 |
| 115633359 | GT-AG | 0 | 1.000000099473604e-05 | 84 | rna-XM_034060800.1 21436521 | 40 | 98169095 | 98169178 | Melopsittacus undulatus 13146 | AAA|GTGAGTACCC...CCCCCCTCCACC/TCCTGGTGGACC...CCCAG|GTA | 2 | 1 | 77.778 |
| 115633360 | GT-AG | 0 | 1.000000099473604e-05 | 73 | rna-XM_034060800.1 21436521 | 41 | 98168881 | 98168953 | Melopsittacus undulatus 13146 | CAG|GTGGGTGCCC...CAGGCCTCAACC/ACAGGCCTCAAC...TCCAG|GTT | 2 | 1 | 79.47 |
| 115633361 | GT-AG | 0 | 1.000000099473604e-05 | 94 | rna-XM_034060800.1 21436521 | 42 | 98168656 | 98168749 | Melopsittacus undulatus 13146 | ACG|GTGAGGGGCA...CATCCCTCACCT/GCATCCCTCACC...TGCAG|GCT | 1 | 1 | 81.042 |
| 115633362 | GT-AG | 0 | 1.000000099473604e-05 | 109 | rna-XM_034060800.1 21436521 | 43 | 98168437 | 98168545 | Melopsittacus undulatus 13146 | CTG|GTGAGGTGGC...CCCATCCTGATG/CCCATCCTGATG...GACAG|GAG | 0 | 1 | 82.361 |
| 115633363 | GT-AG | 0 | 1.000000099473604e-05 | 71 | rna-XM_034060800.1 21436521 | 44 | 98168248 | 98168318 | Melopsittacus undulatus 13146 | AAG|GTGAGCCATG...CCCCACATGACC/ACCGTCCCCATC...CCCAG|GGG | 1 | 1 | 83.777 |
| 115633364 | GT-AG | 0 | 0.0003105761200061 | 80 | rna-XM_034060800.1 21436521 | 45 | 98168070 | 98168149 | Melopsittacus undulatus 13146 | AAG|GTAGCTCATA...CCCTCCTTCTCA/CTCCTTCTCATG...TCCAG|GAG | 0 | 1 | 84.953 |
| 115633365 | GT-AG | 0 | 1.000000099473604e-05 | 93 | rna-XM_034060800.1 21436521 | 46 | 98167860 | 98167952 | Melopsittacus undulatus 13146 | GAG|GTGAGGACAA...GTTGCTGTCTCC/AGAGTACCCACC...TGCAG|GGC | 0 | 1 | 86.357 |
| 115633366 | GT-AG | 0 | 1.000000099473604e-05 | 119 | rna-XM_034060800.1 21436521 | 47 | 98167651 | 98167769 | Melopsittacus undulatus 13146 | CTG|GTGAGTGCAG...CTCTCCATCTCT/AGCTATGTCACC...CCCAG|GTC | 0 | 1 | 87.437 |
| 115633367 | GT-AG | 0 | 1.000000099473604e-05 | 84 | rna-XM_034060800.1 21436521 | 48 | 98167457 | 98167540 | Melopsittacus undulatus 13146 | AAA|GTGAGTTGGG...ATGTCCTTCCCC/GTGCCAGTCATG...ACCAG|GAC | 2 | 1 | 88.757 |
| 115633368 | GT-AG | 0 | 6.329575463301784e-05 | 272 | rna-XM_034060800.1 21436521 | 49 | 98167012 | 98167283 | Melopsittacus undulatus 13146 | TAG|GTATGTCCTC...AGCACCTCACAG/GAGCACCTCACA...CACAG|ATG | 1 | 1 | 90.833 |
| 115633369 | GT-AG | 0 | 1.000000099473604e-05 | 92 | rna-XM_034060800.1 21436521 | 50 | 98166825 | 98166916 | Melopsittacus undulatus 13146 | CAG|GTAGGGGCTT...AGGTCACTGACG/AGGTCACTGACG...GCCAG|GGT | 0 | 1 | 91.973 |
| 115633370 | GT-AG | 0 | 1.000000099473604e-05 | 75 | rna-XM_034060800.1 21436521 | 51 | 98166633 | 98166707 | Melopsittacus undulatus 13146 | AAG|GTGAGCTGGG...TCATCATTGCCT/ACCATCATCATT...CTTAG|GAT | 0 | 1 | 93.377 |
| 115633371 | GT-AG | 0 | 1.000000099473604e-05 | 81 | rna-XM_034060800.1 21436521 | 52 | 98166390 | 98166470 | Melopsittacus undulatus 13146 | AAG|GTATGGCCAA...TTCCCTCTGACT/TTCCCTCTGACT...TTTAG|GAC | 0 | 1 | 95.32 |
| 115633372 | GT-AG | 0 | 1.000000099473604e-05 | 95 | rna-XM_034060800.1 21436521 | 53 | 98166146 | 98166240 | Melopsittacus undulatus 13146 | GAG|GTGAAGGCTG...CCCATCCTATCA/CATCCTATCACC...CACAG|CCT | 2 | 1 | 97.108 |
| 115633373 | GT-AG | 0 | 1.000000099473604e-05 | 694 | rna-XM_034060800.1 21436521 | 54 | 98165316 | 98166009 | Melopsittacus undulatus 13146 | GAG|GTGAGCGTGG...GGGTGCTCATCC/TGGGTGCTCATC...TCCAG|GTG | 0 | 1 | 98.74 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);