introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
40 rows where transcript_id = 9059413
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 48954239 | GT-AG | 0 | 1.000000099473604e-05 | 2913 | rna-XM_036571540.1 9059413 | 1 | 40629333 | 40632245 | Colossoma macropomum 42526 | CGG|GTAAGATAAC...GCATCTTCGAAT/GGTCAGCTGATC...CACAG|TGT | 0 | 1 | 0.519 |
| 48954240 | GT-AG | 0 | 1.7136795688198906e-05 | 2074 | rna-XM_036571540.1 9059413 | 2 | 40627128 | 40629201 | Colossoma macropomum 42526 | CAG|GTAAGCTGCT...GCTGTCTTGGCA/ACTGATTTAATC...TTTAG|CGA | 2 | 1 | 3.351 |
| 48954241 | GT-AG | 0 | 1.000000099473604e-05 | 329 | rna-XM_036571540.1 9059413 | 3 | 40626716 | 40627044 | Colossoma macropomum 42526 | GAG|GTGAGTGCGG...GGAACTTTAATC/TATATTTTCATT...CCCAG|AAT | 1 | 1 | 5.145 |
| 48954242 | GT-AG | 0 | 1.000000099473604e-05 | 2033 | rna-XM_036571540.1 9059413 | 4 | 40624552 | 40626584 | Colossoma macropomum 42526 | CAG|GTGAGACCTC...TTGTTTTTATTC/CTTGTTTTTATT...TGCAG|ATT | 0 | 1 | 7.977 |
| 48954243 | GT-AG | 0 | 1.6113271493317204e-05 | 1785 | rna-XM_036571540.1 9059413 | 5 | 40622654 | 40624438 | Colossoma macropomum 42526 | CAG|GTAACGCGCT...TATGCCTAAACA/TTATGCCTAAAC...TACAG|GCC | 2 | 1 | 10.419 |
| 48954244 | GT-AG | 0 | 1.000000099473604e-05 | 140 | rna-XM_036571540.1 9059413 | 6 | 40622336 | 40622475 | Colossoma macropomum 42526 | GCT|GTGAGTACCT...ACTTACGTGATT/ATTCTGTTCATC...TACAG|GAC | 0 | 1 | 14.267 |
| 48954245 | GT-AG | 0 | 0.0022924511044716 | 116 | rna-XM_036571540.1 9059413 | 7 | 40622123 | 40622238 | Colossoma macropomum 42526 | AAG|GTATATATTA...AGATTTTTAGAG/TATTTATTTATT...TTAAG|ATT | 1 | 1 | 16.364 |
| 48954246 | GT-AG | 0 | 1.000000099473604e-05 | 1569 | rna-XM_036571540.1 9059413 | 8 | 40620481 | 40622049 | Colossoma macropomum 42526 | AAA|GTAAGTATAT...ATGCTCTTCGTT/TCTTCGTTTACC...TGTAG|GGA | 2 | 1 | 17.942 |
| 48954247 | GT-AG | 0 | 6.38443604029931e-05 | 667 | rna-XM_036571540.1 9059413 | 9 | 40619713 | 40620379 | Colossoma macropomum 42526 | AAG|GTAGGCCTGC...ATTGCCTTTTCT/CTGTGGTTAATT...AACAG|ACC | 1 | 1 | 20.125 |
| 48954248 | GT-AG | 0 | 1.000000099473604e-05 | 1860 | rna-XM_036571540.1 9059413 | 10 | 40617753 | 40619612 | Colossoma macropomum 42526 | GAG|GTAAAGAAAT...CATGTTTTACTG/GCATGTTTTACT...CACAG|ATC | 2 | 1 | 22.287 |
| 48954249 | GT-AG | 0 | 8.42015120736351e-05 | 1147 | rna-XM_036571540.1 9059413 | 11 | 40616545 | 40617691 | Colossoma macropomum 42526 | AAG|GTGCACTTTT...CTGATCTTGTCT/GTTAAACTGATC...TGCAG|CGG | 0 | 1 | 23.606 |
| 48954250 | GT-AG | 0 | 1.000000099473604e-05 | 2210 | rna-XM_036571540.1 9059413 | 12 | 40614243 | 40616452 | Colossoma macropomum 42526 | TAG|GTAGGAACTC...TTTTCTTTCTTT/ACTTTGCCCACC...AAAAG|CAT | 2 | 1 | 25.594 |
| 48954251 | GT-AG | 0 | 1.000000099473604e-05 | 352 | rna-XM_036571540.1 9059413 | 13 | 40613740 | 40614091 | Colossoma macropomum 42526 | ATG|GTGAGGGGAA...TTTCTCTTCATT/TTTCTCTTCATT...TACAG|TGG | 0 | 1 | 28.859 |
| 48954252 | GT-AG | 0 | 1.000000099473604e-05 | 2043 | rna-XM_036571540.1 9059413 | 14 | 40611577 | 40613619 | Colossoma macropomum 42526 | TTG|GTTGGTAGAA...CAGATTTTACCA/ACAGATTTTACC...TCCAG|GTT | 0 | 1 | 31.453 |
| 48954253 | GT-AG | 0 | 1.000000099473604e-05 | 915 | rna-XM_036571540.1 9059413 | 15 | 40610576 | 40611490 | Colossoma macropomum 42526 | CAG|GTGAGTCCTT...ATTTCCTTCTTT/AGTAAAGTCATT...TTTAG|GAT | 2 | 1 | 33.312 |
| 48954254 | GT-AG | 0 | 1.000000099473604e-05 | 1661 | rna-XM_036571540.1 9059413 | 16 | 40608779 | 40610439 | Colossoma macropomum 42526 | TCT|GTGAGTAGCC...GAGATCTAAAAT/GACTTGTTTATG...TGCAG|ATG | 0 | 1 | 36.252 |
| 48954255 | GT-AG | 0 | 1.000000099473604e-05 | 2398 | rna-XM_036571540.1 9059413 | 17 | 40606249 | 40608646 | Colossoma macropomum 42526 | AAG|GTATGAAAAG...TGTTCCTTCATT/CATTTTCTAACC...TTTAG|GCT | 0 | 1 | 39.105 |
| 48954256 | GT-AG | 0 | 0.2799139786199134 | 853 | rna-XM_036571540.1 9059413 | 18 | 40605231 | 40606083 | Colossoma macropomum 42526 | ATG|GTATGCTTGT...TAAATTTTAACT/TAAATTTTAACT...CGCAG|GGT | 0 | 1 | 42.672 |
| 48954257 | GT-AG | 0 | 0.0027691289334392 | 5253 | rna-XM_036571540.1 9059413 | 19 | 40599854 | 40605106 | Colossoma macropomum 42526 | CAG|GTAACCGTGA...TCCCGCTTGACA/TTTGTGCTAAAC...GACAG|TTG | 1 | 1 | 45.352 |
| 48954258 | GT-AG | 0 | 1.000000099473604e-05 | 440 | rna-XM_036571540.1 9059413 | 20 | 40599315 | 40599754 | Colossoma macropomum 42526 | ATG|GTGAGTGTCT...GCCTTTTCAAAT/TAGTAAGTAATT...TGCAG|AAG | 1 | 1 | 47.492 |
| 48954259 | GT-AG | 0 | 1.000000099473604e-05 | 295 | rna-XM_036571540.1 9059413 | 21 | 40598925 | 40599219 | Colossoma macropomum 42526 | ATG|GTGAGCTGCT...TTTGTTTTCATG/TTTGTTTTCATG...CACAG|AAG | 0 | 1 | 49.546 |
| 48954260 | GT-AG | 0 | 1.000000099473604e-05 | 3529 | rna-XM_036571540.1 9059413 | 22 | 40595186 | 40598714 | Colossoma macropomum 42526 | CAG|GTTAGTGGCG...TTCCCTTTATCT/TTTCCCTTTATC...TGCAG|GGA | 0 | 1 | 54.086 |
| 48954261 | GT-AG | 0 | 1.000000099473604e-05 | 5914 | rna-XM_036571540.1 9059413 | 23 | 40589140 | 40595053 | Colossoma macropomum 42526 | AGG|GTGAGACACT...GTTTCTTTCATC/GTTTCTTTCATC...TCTAG|ATC | 0 | 1 | 56.939 |
| 48954262 | GT-AG | 0 | 0.0043007349241167 | 1231 | rna-XM_036571540.1 9059413 | 24 | 40587778 | 40589008 | Colossoma macropomum 42526 | AAA|GTATGCATTT...GTATGTGTATTT/CAATGAATCAAG...TCCAG|GGC | 2 | 1 | 59.771 |
| 48954263 | GT-AG | 0 | 1.000000099473604e-05 | 159 | rna-XM_036571540.1 9059413 | 25 | 40587523 | 40587681 | Colossoma macropomum 42526 | CAG|GTGAATTCCA...TGTACCTGATTT/TTGTACCTGATT...TGCAG|TGT | 2 | 1 | 61.846 |
| 48954264 | GT-AG | 0 | 1.000000099473604e-05 | 2086 | rna-XM_036571540.1 9059413 | 26 | 40585334 | 40587419 | Colossoma macropomum 42526 | ATT|GTAAGTGCCT...CATGTCCTGACA/CATGTCCTGACA...TGCAG|GTG | 0 | 1 | 64.073 |
| 48954265 | GT-AG | 0 | 1.000000099473604e-05 | 5094 | rna-XM_036571540.1 9059413 | 27 | 40580160 | 40585253 | Colossoma macropomum 42526 | GAG|GTGAGGCACA...TGGTGCTTGATC/TGGTGCTTGATC...GGCAG|CTT | 2 | 1 | 65.802 |
| 48954266 | GT-AG | 0 | 0.0541439474002858 | 1601 | rna-XM_036571540.1 9059413 | 28 | 40578369 | 40579969 | Colossoma macropomum 42526 | GAG|GTACCTATGC...ATTGCTTTGAAG/ATTGCTTTGAAG...CTCAG|GTC | 0 | 1 | 69.909 |
| 48954267 | GT-AG | 0 | 1.000000099473604e-05 | 1144 | rna-XM_036571540.1 9059413 | 29 | 40577066 | 40578209 | Colossoma macropomum 42526 | CTG|GTGAGGCTAA...TCTACTGTATTT/GAGGTTTGCATT...TGCAG|GGT | 0 | 1 | 73.346 |
| 48954268 | GT-AG | 0 | 1.000000099473604e-05 | 2301 | rna-XM_036571540.1 9059413 | 30 | 40574679 | 40576979 | Colossoma macropomum 42526 | AGC|GTGAGTGTTG...GTGTTTTTATGC/TGTGTTTTTATG...TCCAG|GCA | 2 | 1 | 75.205 |
| 48954269 | GT-AG | 0 | 1.000000099473604e-05 | 1242 | rna-XM_036571540.1 9059413 | 31 | 40573352 | 40574593 | Colossoma macropomum 42526 | AGA|GTGAGTATTT...CACTTGTCAACA/CAACATTTCAGA...TTCAG|GTT | 0 | 1 | 77.043 |
| 48954270 | GT-AG | 0 | 1.000000099473604e-05 | 337 | rna-XM_036571540.1 9059413 | 32 | 40572932 | 40573268 | Colossoma macropomum 42526 | GAG|GTGAGTCCAG...ACTGTGTTAACT/TCTTCGCTCACT...TGAAG|TGT | 2 | 1 | 78.837 |
| 48954271 | GT-AG | 0 | 1.000000099473604e-05 | 6660 | rna-XM_036571540.1 9059413 | 33 | 40566248 | 40572907 | Colossoma macropomum 42526 | AAG|GTAGGGAGCT...AAAAACGTATGT/AAAAAACGTATG...TTCAG|TCT | 2 | 1 | 79.356 |
| 48954272 | GT-AG | 0 | 1.000000099473604e-05 | 88 | rna-XM_036571540.1 9059413 | 34 | 40566058 | 40566145 | Colossoma macropomum 42526 | GAG|GTAGGACGCC...TCTGCTTTCTCT/TCTTTTGTCATT...TTCAG|GAT | 2 | 1 | 81.561 |
| 48954273 | GT-AG | 0 | 1.000000099473604e-05 | 1706 | rna-XM_036571540.1 9059413 | 35 | 40564267 | 40565972 | Colossoma macropomum 42526 | AAG|GTAAAACCGC...ACTGCATTGACT/ACTGCATTGACT...TATAG|GAG | 0 | 1 | 83.398 |
| 48954274 | GT-AG | 0 | 1.000000099473604e-05 | 298 | rna-XM_036571540.1 9059413 | 36 | 40563786 | 40564083 | Colossoma macropomum 42526 | CAG|GTAAGTACAA...AATTCCTTCTTT/TGTTTACTGAAT...CGCAG|ATT | 0 | 1 | 87.354 |
| 48954275 | GT-AG | 0 | 1.000000099473604e-05 | 3019 | rna-XM_036571540.1 9059413 | 37 | 40560606 | 40563624 | Colossoma macropomum 42526 | CAG|GTTGAGAAAA...AGTTTCATATTG/TGCAGTTTCATA...CTCAG|TGC | 2 | 1 | 90.834 |
| 48954276 | GT-AG | 0 | 1.000000099473604e-05 | 98 | rna-XM_036571540.1 9059413 | 38 | 40560393 | 40560490 | Colossoma macropomum 42526 | CAG|GTCAGTACCC...TGCGTTTCAACA/GTGTTACTGATG...GACAG|GAC | 0 | 1 | 93.32 |
| 48954277 | GT-AG | 0 | 0.0006978156793319 | 6924 | rna-XM_036571540.1 9059413 | 39 | 40553352 | 40560275 | Colossoma macropomum 42526 | CTG|GTATGTGTCC...GGTGTTTTAGTG/TGTTTGGTGATT...TGCAG|TGT | 0 | 1 | 95.85 |
| 48954278 | GT-AG | 0 | 1.000000099473604e-05 | 1006 | rna-XM_036571540.1 9059413 | 40 | 40552180 | 40553185 | Colossoma macropomum 42526 | CAG|GTGAGTATGT...TCTTCTTTTGTT/CATTTTCTGAAG...TACAG|GCA | 1 | 1 | 99.438 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);