introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
34 rows where transcript_id = 19079907
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 101756442 | GT-AG | 0 | 1.000000099473604e-05 | 8437 | rna-XM_042886301.1 19079907 | 1 | 6073765 | 6082201 | Lagopus leucura 30410 | CAG|GTAAGGATCA...CAGATTTTAGTG/ACAGATTTTAGT...TTTAG|TTT | 1 | 1 | 0.622 |
| 101756443 | GT-AG | 0 | 0.0396440160348242 | 2805 | rna-XM_042886301.1 19079907 | 2 | 6082469 | 6085273 | Lagopus leucura 30410 | CAG|GTATACTGTT...CATTCTCTAATT/TTTACATTCATT...TGCAG|AAA | 1 | 1 | 7.26 |
| 101756444 | GT-AG | 0 | 1.000000099473604e-05 | 247 | rna-XM_042886301.1 19079907 | 3 | 6085370 | 6085616 | Lagopus leucura 30410 | AAG|GTAATAAAGA...GAAGTCCTAACT/GAAGTCCTAACT...TGCAG|TTC | 1 | 1 | 9.647 |
| 101756445 | GT-AG | 0 | 1.000000099473604e-05 | 148 | rna-XM_042886301.1 19079907 | 4 | 6085704 | 6085851 | Lagopus leucura 30410 | TTG|GTGAGTACAA...ATAATCTTATTT/CATAATCTTATT...ACAAG|GTG | 1 | 1 | 11.81 |
| 101756446 | GT-AG | 0 | 1.000000099473604e-05 | 948 | rna-XM_042886301.1 19079907 | 5 | 6086001 | 6086948 | Lagopus leucura 30410 | AAG|GTGCAATTGA...CATTTTCTAACT/CATTTTCTAACT...TCTAG|GTG | 0 | 1 | 15.515 |
| 101756447 | GT-AG | 0 | 1.000000099473604e-05 | 771 | rna-XM_042886301.1 19079907 | 6 | 6087067 | 6087837 | Lagopus leucura 30410 | ATG|GTAAGATGAA...ATTATTTTGACT/ATTATTTTGACT...ATCAG|AGG | 1 | 1 | 18.449 |
| 101756448 | GT-AG | 0 | 1.000000099473604e-05 | 666 | rna-XM_042886301.1 19079907 | 7 | 6087887 | 6088552 | Lagopus leucura 30410 | CAC|GTGAGTATGT...CTGTGTTTGATA/CTGTGTTTGATA...CTCAG|GAG | 2 | 1 | 19.667 |
| 101756449 | GT-AG | 0 | 0.000346041098775 | 1905 | rna-XM_042886301.1 19079907 | 8 | 6088598 | 6090502 | Lagopus leucura 30410 | CAG|GTAGTCTCTG...TTTCTGTTACTA/CGGAATCTAATT...CACAG|TGC | 2 | 1 | 20.786 |
| 101756450 | GT-AG | 0 | 1.000000099473604e-05 | 700 | rna-XM_042886301.1 19079907 | 9 | 6090560 | 6091259 | Lagopus leucura 30410 | GAG|GTGAGTAAAA...TTCTTTTTACCA/TTTCTTTTTACC...ATCAG|AGA | 2 | 1 | 22.203 |
| 101756451 | GT-AG | 0 | 6.939762182551816e-05 | 4216 | rna-XM_042886301.1 19079907 | 10 | 6091263 | 6095478 | Lagopus leucura 30410 | AGA|GTAAGATTTC...CTTTTCTGAACA/CTTCTGTTCATC...TTCAG|CAG | 2 | 1 | 22.277 |
| 101756452 | GT-AG | 0 | 1.000000099473604e-05 | 620 | rna-XM_042886301.1 19079907 | 11 | 6095503 | 6096122 | Lagopus leucura 30410 | TCG|GTAAGACAAC...TTCATCTTTTTT/AGACTATTTATT...TGCAG|AGG | 2 | 1 | 22.874 |
| 101756453 | GT-AG | 0 | 1.000000099473604e-05 | 802 | rna-XM_042886301.1 19079907 | 12 | 6096290 | 6097091 | Lagopus leucura 30410 | CAG|GTTGGCAAAC...GTTTTCTAAAAA/ATAGTATTAATA...CTTAG|CAT | 1 | 1 | 27.026 |
| 101756454 | GT-AG | 0 | 1.000000099473604e-05 | 227 | rna-XM_042886301.1 19079907 | 13 | 6097225 | 6097451 | Lagopus leucura 30410 | CAG|GTAAACAAGA...TGTGCTCTATCT/GGGATACTCACG...CCTAG|CAA | 2 | 1 | 30.333 |
| 101756455 | GT-AG | 0 | 0.0002197890616141 | 460 | rna-XM_042886301.1 19079907 | 14 | 6097455 | 6097914 | Lagopus leucura 30410 | CAA|GTAAGTTCTC...TTACCCTTACAT/TACTCTCTGATT...TTCAG|ATA | 2 | 1 | 30.408 |
| 101756456 | GT-AG | 0 | 1.000000099473604e-05 | 575 | rna-XM_042886301.1 19079907 | 15 | 6098040 | 6098614 | Lagopus leucura 30410 | AAG|GTGAGCAATG...AGTGTTTTAATC/AGTGTTTTAATC...TCTAG|AGC | 1 | 1 | 33.516 |
| 101756457 | GT-AG | 0 | 1.000000099473604e-05 | 2955 | rna-XM_042886301.1 19079907 | 16 | 6098721 | 6101675 | Lagopus leucura 30410 | ATG|GTAAGGGAAG...CCACCCATAGAA/AAATGACTCAAC...TGCAG|GGA | 2 | 1 | 36.151 |
| 101756458 | GT-AG | 0 | 1.000000099473604e-05 | 366 | rna-XM_042886301.1 19079907 | 17 | 6101843 | 6102208 | Lagopus leucura 30410 | AAG|GTAAGTGCAG...CTATTCTTATTA/CTTATTCTCACA...ATCAG|AGA | 1 | 1 | 40.303 |
| 101756459 | GT-AG | 0 | 1.000000099473604e-05 | 941 | rna-XM_042886301.1 19079907 | 18 | 6102375 | 6103315 | Lagopus leucura 30410 | CAG|GTTAGTCTTC...TTTTCCTGGAAA/TATTTGTTCATG...TCCAG|GAT | 2 | 1 | 44.431 |
| 101756460 | GT-AG | 0 | 4.5391778178976586e-05 | 326 | rna-XM_042886301.1 19079907 | 19 | 6103423 | 6103748 | Lagopus leucura 30410 | TGG|GTAAGTTATC...TTTTTCCTAACT/TTTTTCCTAACT...TGCAG|AGG | 1 | 1 | 47.091 |
| 101756461 | GT-AG | 0 | 1.000000099473604e-05 | 142 | rna-XM_042886301.1 19079907 | 20 | 6103779 | 6103920 | Lagopus leucura 30410 | AAG|GTAAGGACAA...TTTTCTTTGCCT/CTTCCTCTCATA...TAAAG|AAC | 1 | 1 | 47.837 |
| 101756462 | GT-AG | 0 | 1.000000099473604e-05 | 2201 | rna-XM_042886301.1 19079907 | 21 | 6104058 | 6106258 | Lagopus leucura 30410 | AAG|GTAATACGTT...CTTTCTGTGATT/GAAATATTTATC...TTTAG|AAA | 0 | 1 | 51.243 |
| 101756463 | GT-AG | 0 | 1.000000099473604e-05 | 1166 | rna-XM_042886301.1 19079907 | 22 | 6106337 | 6107502 | Lagopus leucura 30410 | AAG|GTGAGTTCCT...TTCCTTTTACTT/TTTACTTTCACT...CACAG|CCT | 0 | 1 | 53.182 |
| 101756464 | GT-AG | 0 | 1.000000099473604e-05 | 1389 | rna-XM_042886301.1 19079907 | 23 | 6107663 | 6109051 | Lagopus leucura 30410 | TTG|GTAAGTGAAA...TGGTTTGTAAAC/TGCATTCTGACG...TTCAG|ATG | 1 | 1 | 57.161 |
| 101756465 | GT-AG | 0 | 1.000000099473604e-05 | 3288 | rna-XM_042886301.1 19079907 | 24 | 6109157 | 6112444 | Lagopus leucura 30410 | TGG|GTAAGAAGTG...GTTCTTTTATAT/TTTTATATCATA...TTCAG|GCT | 1 | 1 | 59.771 |
| 101756466 | GT-AG | 0 | 1.000000099473604e-05 | 1957 | rna-XM_042886301.1 19079907 | 25 | 6112634 | 6114590 | Lagopus leucura 30410 | TTG|GTGAGTTCTA...TACTTCTAAATG/TTACTTCTAAAT...TACAG|CTC | 1 | 1 | 64.47 |
| 101756467 | GT-AG | 0 | 1.000000099473604e-05 | 1114 | rna-XM_042886301.1 19079907 | 26 | 6114726 | 6115839 | Lagopus leucura 30410 | GAT|GTAAGTAAGC...AAATCCTGTACT/GGAAAATTAATA...TGCAG|CTG | 1 | 1 | 67.827 |
| 101756468 | GT-AG | 0 | 1.000000099473604e-05 | 1297 | rna-XM_042886301.1 19079907 | 27 | 6116008 | 6117304 | Lagopus leucura 30410 | TGC|GTGAGTATTC...TTTGTATTAACA/TTTGTATTAACA...TGCAG|AGC | 1 | 1 | 72.004 |
| 101756469 | GT-AG | 0 | 1.000000099473604e-05 | 3755 | rna-XM_042886301.1 19079907 | 28 | 6117394 | 6121148 | Lagopus leucura 30410 | CAG|GTTTGTACTG...ATTATTTTTTCT/AAAAAAATTATT...CTCAG|GGT | 0 | 1 | 74.217 |
| 101756470 | GT-AG | 0 | 1.000000099473604e-05 | 3710 | rna-XM_042886301.1 19079907 | 29 | 6121345 | 6125054 | Lagopus leucura 30410 | TAG|GTAAGAACCC...TTTTTTTTCATG/TTTTTTTTCATG...TCCAG|ATA | 1 | 1 | 79.09 |
| 101756471 | GT-AG | 0 | 8.118376245079748e-05 | 671 | rna-XM_042886301.1 19079907 | 30 | 6125195 | 6125865 | Lagopus leucura 30410 | ATG|GTAAGCTGCA...GACCTCTTAGAA/CCATGTCTGACC...TGCAG|GAA | 0 | 1 | 82.571 |
| 101756472 | GT-AG | 0 | 0.00329670622009 | 540 | rna-XM_042886301.1 19079907 | 31 | 6126026 | 6126565 | Lagopus leucura 30410 | CAG|GTAACATTTA...GTTTCTTTGAAT/GTTTCTTTGAAT...TGCAG|GCA | 1 | 1 | 86.549 |
| 101756473 | GT-AG | 0 | 1.000000099473604e-05 | 1845 | rna-XM_042886301.1 19079907 | 32 | 6126703 | 6128547 | Lagopus leucura 30410 | AAG|GTACAAAATA...CTCCTTTTATAA/GTAAATTTCATT...CTCAG|CCA | 0 | 1 | 89.955 |
| 101756474 | GT-AG | 0 | 1.000000099473604e-05 | 972 | rna-XM_042886301.1 19079907 | 33 | 6128735 | 6129706 | Lagopus leucura 30410 | GAG|GTAATCAGAT...ATCTTCCTAATA/ATCTTCCTAATA...TCCAG|TAC | 1 | 1 | 94.605 |
| 101760770 | GT-AG | 0 | 4.446434968873484e-05 | 478 | rna-XM_042886301.1 19079907 | 34 | 6129748 | 6130225 | Lagopus leucura 30410 | AAG|GTATGAATCA...AAATCCTTCTCC/ATGTATTTGACA...CTCAG|CCC | 0 | 95.624 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);