introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
34 rows where transcript_id = 21436592
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 115635298 | GT-AG | 0 | 1.000000099473604e-05 | 2245 | rna-XM_034067449.1 21436592 | 1 | 151769797 | 151772041 | Melopsittacus undulatus 13146 | AAG|GTAAGGGGCG...ATAACTTTATCT/TTCATTTTTATT...TCTAG|GAG | 1 | 1 | 1.16 |
| 115635299 | GT-AG | 0 | 1.000000099473604e-05 | 1272 | rna-XM_034067449.1 21436592 | 2 | 151768485 | 151769756 | Melopsittacus undulatus 13146 | TAG|GTAAAATAAT...GTATTTTTAAGT/GTATTTTTAAGT...TTTAG|TGC | 2 | 1 | 2.107 |
| 115635300 | GT-AG | 0 | 1.000000099473604e-05 | 1180 | rna-XM_034067449.1 21436592 | 3 | 151767121 | 151768300 | Melopsittacus undulatus 13146 | CAG|GTAATGACTT...TTTCTCTTGTCT/CAAGACTTGAAT...CTCAG|GAG | 0 | 1 | 6.463 |
| 115635301 | GT-AG | 0 | 1.0039876683262393e-05 | 1038 | rna-XM_034067449.1 21436592 | 4 | 151766006 | 151767043 | Melopsittacus undulatus 13146 | ATA|GTAAGTCATT...TTTTTCTTGTTG/CATTATCTGACT...GTTAG|TGG | 2 | 1 | 8.286 |
| 115635302 | GT-AG | 0 | 1.000000099473604e-05 | 909 | rna-XM_034067449.1 21436592 | 5 | 151765059 | 151765967 | Melopsittacus undulatus 13146 | TGG|GTAAGTGCAT...TTGGTTTTAATT/TTGGTTTTAATT...TTTAG|GAC | 1 | 1 | 9.186 |
| 115635303 | GT-AG | 0 | 0.000266405855031 | 369 | rna-XM_034067449.1 21436592 | 6 | 151764592 | 151764960 | Melopsittacus undulatus 13146 | AAG|GTACATTTAT...CCAGACTTAATA/CATCTGTTTATA...TTCAG|GCT | 0 | 1 | 11.506 |
| 115635304 | GT-AG | 0 | 0.0004320184261434 | 551 | rna-XM_034067449.1 21436592 | 7 | 151763861 | 151764411 | Melopsittacus undulatus 13146 | CAG|GTATGTCTTG...CCTTTTTTTATT/CCTTTTTTTATT...CTTAG|GTA | 0 | 1 | 15.767 |
| 115635305 | GT-AG | 0 | 1.000000099473604e-05 | 96 | rna-XM_034067449.1 21436592 | 8 | 151763555 | 151763650 | Melopsittacus undulatus 13146 | AAG|GTTAGTTCTG...TTCACTTTTACT/AATCAATTCACT...CTTAG|GAA | 0 | 1 | 20.739 |
| 115635306 | GT-AG | 0 | 1.000000099473604e-05 | 1086 | rna-XM_034067449.1 21436592 | 9 | 151762343 | 151763428 | Melopsittacus undulatus 13146 | CGG|GTAATACAAA...CTAAAGTTAACT/CTAAAGTTAACT...AACAG|GAT | 0 | 1 | 23.722 |
| 115635307 | GT-AG | 0 | 1.000000099473604e-05 | 605 | rna-XM_034067449.1 21436592 | 10 | 151761615 | 151762219 | Melopsittacus undulatus 13146 | AAG|GTAAGAGAGT...GTATCTTTAGTA/TCTTTAGTAATT...GAAAG|GCT | 0 | 1 | 26.634 |
| 115635308 | GT-AG | 0 | 1.000000099473604e-05 | 2267 | rna-XM_034067449.1 21436592 | 11 | 151759224 | 151761490 | Melopsittacus undulatus 13146 | AAG|GTAAGAGTTG...ATTTCTTTGACT/CTTTGACTTACC...TACAG|GTG | 1 | 1 | 29.569 |
| 115635309 | GT-AG | 0 | 1.000000099473604e-05 | 852 | rna-XM_034067449.1 21436592 | 12 | 151758280 | 151759131 | Melopsittacus undulatus 13146 | AAG|GTATGGAAGT...GTGACCATGACT/ACTTTTTTCACT...CTAAG|AAT | 0 | 1 | 31.747 |
| 115635310 | GT-AG | 0 | 1.000000099473604e-05 | 1044 | rna-XM_034067449.1 21436592 | 13 | 151757026 | 151758069 | Melopsittacus undulatus 13146 | CAG|GTAAGTTCTT...ATTGCCTGGAAA/AACCTGGTGAAT...TACAG|GTG | 0 | 1 | 36.719 |
| 115635311 | GT-AG | 0 | 2.6371816246596683e-05 | 426 | rna-XM_034067449.1 21436592 | 14 | 151756422 | 151756847 | Melopsittacus undulatus 13146 | GAG|GTAACAATAT...TACTCTGTGATT/TACTCTGTGATT...TCTAG|GTC | 1 | 1 | 40.933 |
| 115635312 | GT-AG | 0 | 1.000000099473604e-05 | 2065 | rna-XM_034067449.1 21436592 | 15 | 151754215 | 151756279 | Melopsittacus undulatus 13146 | CAA|GTGAGACAAA...GTGGTTTTGACA/GTGGTTTTGACA...AACAG|GAA | 2 | 1 | 44.295 |
| 115635313 | GT-AG | 0 | 3.4465265995842854e-05 | 271 | rna-XM_034067449.1 21436592 | 16 | 151753802 | 151754072 | Melopsittacus undulatus 13146 | AAG|GTATGGTAAG...AGCATTTTATTT/GAGCATTTTATT...AACAG|AAC | 0 | 1 | 47.656 |
| 115635314 | GT-AG | 0 | 1.000000099473604e-05 | 894 | rna-XM_034067449.1 21436592 | 17 | 151752707 | 151753600 | Melopsittacus undulatus 13146 | CAG|GTGAGTGTTA...AGTGCTCTAATT/AGTGCTCTAATT...CACAG|GAG | 0 | 1 | 52.415 |
| 115635315 | GT-AG | 0 | 1.000000099473604e-05 | 765 | rna-XM_034067449.1 21436592 | 18 | 151751834 | 151752598 | Melopsittacus undulatus 13146 | GAG|GTGAGAGAAC...TTCCATTTAAAA/TTAAAACTTATA...TCAAG|CTT | 0 | 1 | 54.972 |
| 115635316 | GT-AG | 0 | 1.000000099473604e-05 | 554 | rna-XM_034067449.1 21436592 | 19 | 151751174 | 151751727 | Melopsittacus undulatus 13146 | ATA|GTAAGTCTGT...TTGTACCTAGCA/TTCCAACTAACA...GACAG|TAT | 1 | 1 | 57.481 |
| 115635317 | GT-AG | 0 | 0.0017486407862981 | 1379 | rna-XM_034067449.1 21436592 | 20 | 151749629 | 151751007 | Melopsittacus undulatus 13146 | CAG|GTATGTTGGC...AAACTCTTAATG/TTTGGTTTTATT...GACAG|GCT | 2 | 1 | 61.411 |
| 115635318 | GT-AG | 0 | 1.000000099473604e-05 | 1164 | rna-XM_034067449.1 21436592 | 21 | 151748323 | 151749486 | Melopsittacus undulatus 13146 | CGT|GTAAGACAGT...TGTATTTTGATA/TGTATTTTGATA...TCCAG|AAT | 0 | 1 | 64.773 |
| 115635319 | GT-AG | 0 | 0.0004798042342175 | 194 | rna-XM_034067449.1 21436592 | 22 | 151748049 | 151748242 | Melopsittacus undulatus 13146 | AAA|GTAAGTTTTA...TATGTCTTACTA/ATATGTCTTACT...TCCAG|GGA | 2 | 1 | 66.667 |
| 115635320 | GT-AG | 0 | 1.000000099473604e-05 | 266 | rna-XM_034067449.1 21436592 | 23 | 151747707 | 151747972 | Melopsittacus undulatus 13146 | CAG|GTACAACAGC...CTTTCCTTCACT/CTTTCCTTCACT...TCCAG|GAA | 0 | 1 | 68.466 |
| 115635321 | GT-AG | 0 | 1.000000099473604e-05 | 1083 | rna-XM_034067449.1 21436592 | 24 | 151746537 | 151747619 | Melopsittacus undulatus 13146 | AAG|GTTTGGGCTT...TATGTATTATCA/TTAAGATTAACT...TGCAG|GAA | 0 | 1 | 70.526 |
| 115635322 | GT-AG | 0 | 0.0011948880802965 | 839 | rna-XM_034067449.1 21436592 | 25 | 151745593 | 151746431 | Melopsittacus undulatus 13146 | AAT|GTATGTCCTG...TAAAATTTAATA/TAAAATTTAATA...TGTAG|TGC | 0 | 1 | 73.011 |
| 115635323 | GT-AG | 0 | 1.000000099473604e-05 | 1670 | rna-XM_034067449.1 21436592 | 26 | 151743800 | 151745469 | Melopsittacus undulatus 13146 | GAG|GTAAAAACAA...GTAACTTTGACA/TCTCTGCTTATC...AATAG|AGA | 0 | 1 | 75.923 |
| 115635324 | GT-AG | 0 | 1.000000099473604e-05 | 750 | rna-XM_034067449.1 21436592 | 27 | 151742903 | 151743652 | Melopsittacus undulatus 13146 | CAG|GTAAAACAAG...TCACTTCTAATT/TCACTTCTAATT...AATAG|TTG | 0 | 1 | 79.403 |
| 115635325 | GT-AG | 0 | 7.534264299031725e-05 | 249 | rna-XM_034067449.1 21436592 | 28 | 151742549 | 151742797 | Melopsittacus undulatus 13146 | CAG|GTATGGCATT...TTTCTTTTGACT/TTTCTTTTGACT...ATCAG|TGT | 0 | 1 | 81.889 |
| 115635326 | GT-AG | 0 | 1.000000099473604e-05 | 446 | rna-XM_034067449.1 21436592 | 29 | 151741938 | 151742383 | Melopsittacus undulatus 13146 | AAG|GTAATATGAG...AGGTCTTCAACT/TCCAATTTCACT...CTTAG|GAA | 0 | 1 | 85.795 |
| 115635327 | GT-AG | 0 | 1.000000099473604e-05 | 645 | rna-XM_034067449.1 21436592 | 30 | 151741186 | 151741830 | Melopsittacus undulatus 13146 | CAG|GTAAAATAAC...TACAACTTGATT/CTCTTTCTGAGA...CTTAG|TAA | 2 | 1 | 88.329 |
| 115635328 | GT-AG | 0 | 4.453901804896216e-05 | 551 | rna-XM_034067449.1 21436592 | 31 | 151740562 | 151741112 | Melopsittacus undulatus 13146 | AAG|GTATTACCCT...ACATTTTTGAAC/ACATTTTTGAAC...TACAG|AAT | 0 | 1 | 90.057 |
| 115635329 | GT-AG | 0 | 0.0002121732183681 | 1734 | rna-XM_034067449.1 21436592 | 32 | 151738790 | 151740523 | Melopsittacus undulatus 13146 | AGA|GTAAGTTTAA...GATTCTTTTACA/GGGGTTCTAATT...GACAG|ATT | 2 | 1 | 90.956 |
| 115635330 | GT-AG | 0 | 1.000000099473604e-05 | 380 | rna-XM_034067449.1 21436592 | 33 | 151738244 | 151738623 | Melopsittacus undulatus 13146 | AAG|GTTATACATG...ATGTCATTATTA/AATGTCATTATT...TGCAG|GAA | 0 | 1 | 94.886 |
| 115635331 | GT-AG | 0 | 0.0001394680607898 | 293 | rna-XM_034067449.1 21436592 | 34 | 151737819 | 151738111 | Melopsittacus undulatus 13146 | GCG|GTAAGTTGTA...ACATCCTTAATG/CACATCCTTAAT...TGCAG|GAG | 0 | 1 | 98.011 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);