introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
53 rows where transcript_id = 9059417
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 48954330 | GT-AG | 0 | 1.000000099473604e-05 | 32378 | rna-XM_036577461.1 9059417 | 1 | 10071891 | 10104268 | Colossoma macropomum 42526 | GTG|GTAAGTTGAA...AGGCTTTTATTT/CATCTGCTCACC...TGTAG|GAG | 1 | 1 | 2.033 |
| 48954331 | GT-AG | 0 | 5.798562820469158e-05 | 140 | rna-XM_036577461.1 9059417 | 2 | 10104491 | 10104630 | Colossoma macropomum 42526 | ACG|GTATGAACCA...TGTTTTTTACAT/CATGTACTCATC...TTCAG|ACA | 1 | 1 | 6.993 |
| 48954332 | GT-AG | 0 | 0.1125879166188585 | 119 | rna-XM_036577461.1 9059417 | 3 | 10104645 | 10104763 | Colossoma macropomum 42526 | AAG|GTAACCTTTA...TCTACCATAATC/TTTGTGGTAAAT...TTCAG|GGT | 0 | 1 | 7.306 |
| 48954333 | GT-AG | 0 | 0.0002478105599774 | 1097 | rna-XM_036577461.1 9059417 | 4 | 10104797 | 10105893 | Colossoma macropomum 42526 | ATT|GTACGTATAA...CTGTATTTAATG/TTAAATTTAATT...TACAG|GTC | 0 | 1 | 8.043 |
| 48954334 | GT-AG | 0 | 8.444498560368492e-05 | 2138 | rna-XM_036577461.1 9059417 | 5 | 10105927 | 10108064 | Colossoma macropomum 42526 | ATG|GTAAGCCTGT...ACTGTCTTACCA/CAATCTCTAATT...AACAG|GGA | 0 | 1 | 8.78 |
| 48954335 | GT-AG | 0 | 1.000000099473604e-05 | 116 | rna-XM_036577461.1 9059417 | 6 | 10108119 | 10108234 | Colossoma macropomum 42526 | CCT|GTAAGAAACA...GTACTTTTAATG/CCATTGTTTATA...TGTAG|GGT | 0 | 1 | 9.987 |
| 48954336 | GT-AG | 0 | 1.000000099473604e-05 | 480 | rna-XM_036577461.1 9059417 | 7 | 10108322 | 10108801 | Colossoma macropomum 42526 | GGA|GTGAGTACAG...TTCTCTTTGTCT/CAGATATTTAGA...CTTAG|GGT | 0 | 1 | 11.93 |
| 48954337 | GT-AG | 0 | 1.000000099473604e-05 | 2474 | rna-XM_036577461.1 9059417 | 8 | 10108880 | 10111353 | Colossoma macropomum 42526 | AGG|GTAAGGGGCA...ACTATTCTAATA/ATAATACTCATC...TTTAG|GGG | 0 | 1 | 13.673 |
| 48954338 | GT-AG | 0 | 2.601688171207284e-05 | 83 | rna-XM_036577461.1 9059417 | 9 | 10111399 | 10111481 | Colossoma macropomum 42526 | CCA|GTAAGTGTGA...TTTCCATTAACT/TTTCCATTAACT...CTTAG|GGG | 0 | 1 | 14.678 |
| 48954339 | GT-AG | 0 | 1.000000099473604e-05 | 393 | rna-XM_036577461.1 9059417 | 10 | 10111536 | 10111928 | Colossoma macropomum 42526 | ATG|GTGAGTGAAT...TTAGCCATAAAC/AAACTTCTGATC...TACAG|GGT | 0 | 1 | 15.885 |
| 48954340 | GT-AG | 0 | 1.000000099473604e-05 | 119 | rna-XM_036577461.1 9059417 | 11 | 10111983 | 10112101 | Colossoma macropomum 42526 | GAT|GTGAGTACAG...GAAGTTTCAATC/TTCAATCTCACC...CGTAG|GGA | 0 | 1 | 17.091 |
| 48954341 | GT-AG | 0 | 0.0001242368091326 | 116 | rna-XM_036577461.1 9059417 | 12 | 10112156 | 10112271 | Colossoma macropomum 42526 | GCG|GTAAACAGCA...ATTACTTTAAAA/TAAGTTTTCAGT...TTTAG|GGA | 0 | 1 | 18.298 |
| 48954342 | GT-AG | 0 | 1.000000099473604e-05 | 539 | rna-XM_036577461.1 9059417 | 13 | 10112326 | 10112864 | Colossoma macropomum 42526 | AGA|GTGAGTACAT...TTAAGCTTGATT/TTGATTCTGACA...TTTAG|GGA | 0 | 1 | 19.504 |
| 48954343 | GT-AG | 0 | 0.0010776816152066 | 186 | rna-XM_036577461.1 9059417 | 14 | 10112919 | 10113104 | Colossoma macropomum 42526 | AAA|GTACACAGCT...GACATTTTACTT/CCTGTTCTGACA...CTTAG|GGT | 0 | 1 | 20.71 |
| 48954344 | GT-AG | 0 | 1.000000099473604e-05 | 120 | rna-XM_036577461.1 9059417 | 15 | 10113150 | 10113269 | Colossoma macropomum 42526 | ATG|GTAAAGTGAA...TGTTTCATAATG/TTATGTTTCATA...TGTAG|GGC | 0 | 1 | 21.716 |
| 48954345 | GT-AG | 0 | 1.000000099473604e-05 | 204 | rna-XM_036577461.1 9059417 | 16 | 10113324 | 10113527 | Colossoma macropomum 42526 | CCT|GTAAGTACAA...CTCATATTAATA/ATTGCACTCATA...TTCAG|GGT | 0 | 1 | 22.922 |
| 48954346 | GT-AG | 0 | 1.000000099473604e-05 | 101 | rna-XM_036577461.1 9059417 | 17 | 10113573 | 10113673 | Colossoma macropomum 42526 | ATG|GTAAGACACC...TAATCTTTGAAC/TTTAAATTGATA...TAAAG|GGT | 0 | 1 | 23.928 |
| 48954347 | GT-AG | 0 | 1.000000099473604e-05 | 196 | rna-XM_036577461.1 9059417 | 18 | 10113728 | 10113923 | Colossoma macropomum 42526 | AAG|GTAAGCCATG...TGTTTATTAATA/TGTTTATTAATA...GATAG|GGA | 0 | 1 | 25.134 |
| 48954348 | GT-AG | 0 | 1.000000099473604e-05 | 113 | rna-XM_036577461.1 9059417 | 19 | 10114023 | 10114135 | Colossoma macropomum 42526 | CAA|GTCCGTACTC...TTTGCTTTTATG/TTGAATCTAATT...TATAG|GGT | 0 | 1 | 27.346 |
| 48954349 | GT-AG | 0 | 4.7628158108820834e-05 | 107 | rna-XM_036577461.1 9059417 | 20 | 10114181 | 10114287 | Colossoma macropomum 42526 | GTG|GTATGTAAAC...GAATCATTAGAG/CACAGAATCATT...TCCAG|GGC | 0 | 1 | 28.351 |
| 48954350 | GT-AG | 0 | 1.000000099473604e-05 | 245 | rna-XM_036577461.1 9059417 | 21 | 10114387 | 10114631 | Colossoma macropomum 42526 | GTG|GTAAGATCTT...GGTTTCTTAGAT/CTTAGATTCACC...TTCAG|GGA | 0 | 1 | 30.563 |
| 48954351 | GT-AG | 0 | 3.9221658116970726e-05 | 158 | rna-XM_036577461.1 9059417 | 22 | 10114686 | 10114843 | Colossoma macropomum 42526 | CAA|GTAAGTTGTG...AATGTCTGAACT/TAATGTCTGAAC...CGCAG|GGG | 0 | 1 | 31.769 |
| 48954352 | GT-AG | 0 | 1.000000099473604e-05 | 122 | rna-XM_036577461.1 9059417 | 23 | 10114952 | 10115073 | Colossoma macropomum 42526 | AGA|GTGAGTCAGA...TTTTCATTAATA/TTTTTTTTCATT...TTCAG|GGA | 0 | 1 | 34.182 |
| 48954353 | GT-AG | 0 | 1.000000099473604e-05 | 669 | rna-XM_036577461.1 9059417 | 24 | 10115128 | 10115796 | Colossoma macropomum 42526 | AAG|GTAAGTACAT...GTCACTGTAATT/TAAAGTGTCACT...CTAAG|GGT | 0 | 1 | 35.389 |
| 48954354 | GT-AG | 0 | 1.000000099473604e-05 | 120 | rna-XM_036577461.1 9059417 | 25 | 10115896 | 10116015 | Colossoma macropomum 42526 | AGG|GTAAAACATA...CATGTTTTGACA/ACAGTATTCATT...TGTAG|GGT | 0 | 1 | 37.601 |
| 48954355 | GT-AG | 0 | 0.0310027176272765 | 793 | rna-XM_036577461.1 9059417 | 26 | 10116070 | 10116862 | Colossoma macropomum 42526 | CCG|GTATGTTTGA...TTACCTTTAAAA/CTGAAATTAAGT...TGTAG|GGT | 0 | 1 | 38.807 |
| 48954356 | GT-AG | 0 | 1.000000099473604e-05 | 100 | rna-XM_036577461.1 9059417 | 27 | 10116962 | 10117061 | Colossoma macropomum 42526 | AGC|GTAAGTCAAG...GTTTTCATAGTG/AGTGCTCTCATT...TTCAG|GGG | 0 | 1 | 41.019 |
| 48954357 | GT-AG | 0 | 1.000000099473604e-05 | 83 | rna-XM_036577461.1 9059417 | 28 | 10117116 | 10117198 | Colossoma macropomum 42526 | AGG|GTAGGTAACA...CAAGTCATGACA/ATATTAGTAATC...TGTAG|GGG | 0 | 1 | 42.225 |
| 48954358 | GT-AG | 0 | 1.000000099473604e-05 | 106 | rna-XM_036577461.1 9059417 | 29 | 10117253 | 10117358 | Colossoma macropomum 42526 | CCT|GTAAGTGCTT...AATGTTTCATCT/AAATGTTTCATC...CACAG|GGA | 0 | 1 | 43.432 |
| 48954359 | GT-AG | 0 | 1.000000099473604e-05 | 150 | rna-XM_036577461.1 9059417 | 30 | 10117413 | 10117562 | Colossoma macropomum 42526 | CAG|GTGAGCTGTT...ACCTCTTTATCC/ACATTACTGAGT...CTCAG|GGA | 0 | 1 | 44.638 |
| 48954360 | GT-AG | 0 | 1.000000099473604e-05 | 127 | rna-XM_036577461.1 9059417 | 31 | 10117617 | 10117743 | Colossoma macropomum 42526 | CAG|GTAAGATTAT...ACTATTTTAAAC/CATAGACTCACT...TTTAG|GGT | 0 | 1 | 45.845 |
| 48954361 | GT-AG | 0 | 4.5319502838760936e-05 | 621 | rna-XM_036577461.1 9059417 | 32 | 10117789 | 10118409 | Colossoma macropomum 42526 | AGG|GTAAGTTTGC...TAAGTTTTAATG/TAAGTTTTAATG...TACAG|GGA | 0 | 1 | 46.85 |
| 48954362 | GT-AG | 0 | 1.000000099473604e-05 | 144 | rna-XM_036577461.1 9059417 | 33 | 10118509 | 10118652 | Colossoma macropomum 42526 | AAG|GTAAATGACT...TAGTCCTAAATG/GTAGTCCTAAAT...CACAG|GGA | 0 | 1 | 49.062 |
| 48954363 | GT-AG | 0 | 1.000000099473604e-05 | 83 | rna-XM_036577461.1 9059417 | 34 | 10118761 | 10118843 | Colossoma macropomum 42526 | CGA|GTAAGTACAA...ACATATTTATTG/TATTGTTTCACA...TACAG|GGG | 0 | 1 | 51.475 |
| 48954364 | GT-AG | 0 | 0.0066513540474003 | 420 | rna-XM_036577461.1 9059417 | 35 | 10118898 | 10119317 | Colossoma macropomum 42526 | AGA|GTAAGCTTGA...TTTCTTTTAATA/TTTCTTTTAATA...CACAG|GGG | 0 | 1 | 52.681 |
| 48954365 | GT-AG | 0 | 1.000000099473604e-05 | 85 | rna-XM_036577461.1 9059417 | 36 | 10119372 | 10119456 | Colossoma macropomum 42526 | AAG|GTAAGCAGAT...CTATTGTTAACA/CTATTGTTAACA...AACAG|GGG | 0 | 1 | 53.887 |
| 48954366 | GT-AG | 0 | 0.0024782668605959 | 92 | rna-XM_036577461.1 9059417 | 37 | 10119511 | 10119602 | Colossoma macropomum 42526 | CCT|GTATGTCAGC...TATCCTTGGACT/CACATTTTCATA...TTTAG|GGA | 0 | 1 | 55.094 |
| 48954367 | GT-AG | 0 | 0.0004853829649808 | 164 | rna-XM_036577461.1 9059417 | 38 | 10119657 | 10119820 | Colossoma macropomum 42526 | CCT|GTAAGTTCAT...CTATTCTTATCA/GCTATTCTTATC...TTCAG|GGT | 0 | 1 | 56.3 |
| 48954368 | GT-AG | 0 | 1.000000099473604e-05 | 89 | rna-XM_036577461.1 9059417 | 39 | 10119929 | 10120017 | Colossoma macropomum 42526 | GTG|GTAGGTGACA...CATGCCTGGATG/CAACATCTGAAA...TATAG|GGC | 0 | 1 | 58.713 |
| 48954369 | GT-AG | 0 | 1.000000099473604e-05 | 106 | rna-XM_036577461.1 9059417 | 40 | 10120072 | 10120177 | Colossoma macropomum 42526 | CCT|GTAAGTGAAG...AATGCTTCATTC/AAATGCTTCATT...TTCAG|GGC | 0 | 1 | 59.92 |
| 48954370 | GT-AG | 0 | 1.000000099473604e-05 | 104 | rna-XM_036577461.1 9059417 | 41 | 10120232 | 10120335 | Colossoma macropomum 42526 | ACT|GTGAGTATCC...AAAATATTGACC/AAAATATTGACC...TTCAG|GGT | 0 | 1 | 61.126 |
| 48954371 | GT-AG | 0 | 0.0001463273827415 | 262 | rna-XM_036577461.1 9059417 | 42 | 10120498 | 10120759 | Colossoma macropomum 42526 | ACA|GTAAGCATTC...AATTAATTAATT/AATTAATTAATT...TGAAG|GGT | 0 | 1 | 64.745 |
| 48954372 | GT-AG | 0 | 7.048195517770208e-05 | 1112 | rna-XM_036577461.1 9059417 | 43 | 10120868 | 10121979 | Colossoma macropomum 42526 | GCT|GTAAGTATCC...ATTGCCTTTGCA/GTCTTCCTCACA...TACAG|GGT | 0 | 1 | 67.158 |
| 48954373 | GT-AG | 0 | 8.36605793551655e-05 | 2847 | rna-XM_036577461.1 9059417 | 44 | 10122088 | 10124934 | Colossoma macropomum 42526 | GAT|GTAAGTTTAA...GATAACTTAATG/TAATGATTAACC...GATAG|GGA | 0 | 1 | 69.571 |
| 48954374 | GT-AG | 0 | 1.000000099473604e-05 | 107 | rna-XM_036577461.1 9059417 | 45 | 10124989 | 10125095 | Colossoma macropomum 42526 | CGG|GTAAGATCTG...TTTTCCATGTCT/CCATGTCTAACT...TGTAG|GGT | 0 | 1 | 70.777 |
| 48954375 | GT-AG | 0 | 0.0161847836512704 | 120 | rna-XM_036577461.1 9059417 | 46 | 10125204 | 10125323 | Colossoma macropomum 42526 | CCA|GTATGTATTG...TTATTTTGGACA/TTAATATTTATT...AACAG|GGT | 0 | 1 | 73.19 |
| 48954376 | GT-AG | 0 | 1.000000099473604e-05 | 143 | rna-XM_036577461.1 9059417 | 47 | 10125378 | 10125520 | Colossoma macropomum 42526 | CCA|GTAAGACATT...AATCTGTTGACA/AATCTGTTGACA...TATAG|GGA | 0 | 1 | 74.397 |
| 48954377 | GT-AG | 0 | 1.000000099473604e-05 | 137 | rna-XM_036577461.1 9059417 | 48 | 10125629 | 10125765 | Colossoma macropomum 42526 | CCT|GTAAGTATCA...AATTCATCAATA/GATTAATTCATC...TTTAG|GGT | 0 | 1 | 76.81 |
| 48954378 | GT-AG | 0 | 1.000000099473604e-05 | 106 | rna-XM_036577461.1 9059417 | 49 | 10125820 | 10125925 | Colossoma macropomum 42526 | AGA|GTAAGTGCTA...TAATCTTTTGTA/GACCAATTCATT...TGTAG|GGA | 0 | 1 | 78.016 |
| 48954379 | GT-AG | 0 | 0.0052260453324729 | 2829 | rna-XM_036577461.1 9059417 | 50 | 10126034 | 10128862 | Colossoma macropomum 42526 | GAG|GTATGCACAT...CTCTTTTTAATT/TTTAGTTTCATT...AACAG|GGA | 0 | 1 | 80.429 |
| 48954380 | GT-AG | 0 | 1.7613279950400442e-05 | 114 | rna-XM_036577461.1 9059417 | 51 | 10129170 | 10129283 | Colossoma macropomum 42526 | GTG|GTAAGCCAAT...TGTGTTTTGACG/TGTGTTTTGACG...CACAG|GTG | 1 | 1 | 87.288 |
| 48954381 | GT-AG | 0 | 1.000000099473604e-05 | 1724 | rna-XM_036577461.1 9059417 | 52 | 10129463 | 10131186 | Colossoma macropomum 42526 | TAC|GTGAGTACAT...AAGGCCTTGTGT/ACGAGTCTGACA...TTCAG|TTC | 0 | 1 | 91.287 |
| 48954382 | GT-AG | 0 | 0.0001502580065237 | 269 | rna-XM_036577461.1 9059417 | 53 | 10131430 | 10131698 | Colossoma macropomum 42526 | TCC|GTAAGTCTTT...CTCTCTTTCTCC/TCTAAAATCATA...CACAG|CAA | 0 | 1 | 96.716 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);