introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
51 rows where transcript_id = 9059403
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 48953988 | GT-AG | 0 | 0.0001295348323083 | 26654 | rna-XM_036594868.1 9059403 | 1 | 21466035 | 21492688 | Colossoma macropomum 42526 | GAG|GTAAGCTCGT...CACTTTTTAATT/CACTTTTTAATT...TTCAG|GGA | 0 | 1 | 1.465 |
| 48953989 | GT-AG | 0 | 0.0011770726732018 | 1392 | rna-XM_036594868.1 9059403 | 2 | 21492746 | 21494137 | Colossoma macropomum 42526 | AAG|GTACCGCTCT...CCTGTTTTATCT/TTCATTTTCAAT...TATAG|GGT | 0 | 1 | 2.625 |
| 48953990 | GT-AG | 0 | 0.0003047881808559 | 118 | rna-XM_036594868.1 9059403 | 3 | 21494228 | 21494345 | Colossoma macropomum 42526 | AAG|GTATGCCACA...GTACTGTTGATC/TTCATGTTTATT...CATAG|GGG | 0 | 1 | 4.457 |
| 48953991 | GT-AG | 0 | 1.2751655780864945e-05 | 381 | rna-XM_036594868.1 9059403 | 4 | 21494391 | 21494771 | Colossoma macropomum 42526 | CGT|GTAAGTAATG...TGATTCTTGTCT/CAAATTGTGATT...CTTAG|GGA | 0 | 1 | 5.372 |
| 48953992 | GT-AG | 0 | 1.000000099473604e-05 | 1309 | rna-XM_036594868.1 9059403 | 5 | 21494817 | 21496125 | Colossoma macropomum 42526 | CCG|GTAAGTTCCA...TATGCCTAATTC/GTATGCCTAATT...CTCAG|GGT | 0 | 1 | 6.288 |
| 48953993 | GT-AG | 0 | 0.0003611181116802 | 122 | rna-XM_036594868.1 9059403 | 6 | 21496189 | 21496310 | Colossoma macropomum 42526 | AAA|GTAAGCATTT...TTAGTCTTAAAG/AATTGGTTGATT...TGCAG|GGG | 0 | 1 | 7.57 |
| 48953994 | GT-AG | 0 | 1.000000099473604e-05 | 539 | rna-XM_036594868.1 9059403 | 7 | 21496365 | 21496903 | Colossoma macropomum 42526 | CCA|GTAAGGAAAA...TTTTTTTTATAT/TTTTTTTTTATA...CACAG|GGT | 0 | 1 | 8.669 |
| 48953995 | GT-AG | 0 | 1.000000099473604e-05 | 193 | rna-XM_036594868.1 9059403 | 8 | 21496931 | 21497123 | Colossoma macropomum 42526 | AAG|GTAGGAAAAC...GTAACCTGAAAT/AGTAACCTGAAA...TTTAG|GGA | 0 | 1 | 9.219 |
| 48953996 | GT-AG | 0 | 1.000000099473604e-05 | 107 | rna-XM_036594868.1 9059403 | 9 | 21497202 | 21497308 | Colossoma macropomum 42526 | CCA|GTAAGTGTGG...TGGCTGTTACCC/AAGCATCTGATT...CTAAG|GGA | 0 | 1 | 10.806 |
| 48953997 | GT-AG | 0 | 1.000000099473604e-05 | 182 | rna-XM_036594868.1 9059403 | 10 | 21497372 | 21497553 | Colossoma macropomum 42526 | AAA|GTGAGTGGTC...TTTCTCTTCTCT/TGTAGCATCAAT...CTTAG|GGA | 0 | 1 | 12.088 |
| 48953998 | GT-AG | 0 | 2.82784734104628e-05 | 102 | rna-XM_036594868.1 9059403 | 11 | 21497590 | 21497691 | Colossoma macropomum 42526 | CCT|GTAGGTGTTT...TAATCAGTAACT/TAGTAAGTAATC...TGCAG|GGA | 0 | 1 | 12.821 |
| 48953999 | GT-AG | 0 | 1.000000099473604e-05 | 898 | rna-XM_036594868.1 9059403 | 12 | 21497728 | 21498625 | Colossoma macropomum 42526 | AAG|GTGAGTCAGA...TAATGCTTCCTT/TAACAATTCAGA...ACTAG|GGT | 0 | 1 | 13.553 |
| 48954000 | GT-AG | 0 | 1.000000099473604e-05 | 506 | rna-XM_036594868.1 9059403 | 13 | 21498719 | 21499224 | Colossoma macropomum 42526 | CCG|GTAGGAGGGC...CTGTTTTTGCTT/TAATTACTAAAT...TGTAG|GGG | 0 | 1 | 15.446 |
| 48954001 | GT-AG | 0 | 1.000000099473604e-05 | 189 | rna-XM_036594868.1 9059403 | 14 | 21499261 | 21499449 | Colossoma macropomum 42526 | AAA|GTAAGTGCTT...TGGGCACTAATT/TGGGCACTAATT...GCCAG|TGC | 0 | 1 | 16.178 |
| 48954002 | GT-AG | 0 | 2.271663568156184e-05 | 177 | rna-XM_036594868.1 9059403 | 15 | 21499504 | 21499680 | Colossoma macropomum 42526 | CCA|GTAAGTAGCA...TTTTCCTTTTCT/TGCAAACTAAAT...CTTAG|GGA | 0 | 1 | 17.277 |
| 48954003 | GT-AG | 0 | 1.000000099473604e-05 | 324 | rna-XM_036594868.1 9059403 | 16 | 21499717 | 21500040 | Colossoma macropomum 42526 | ACG|GTAAGATATG...AAGTCTTTGTTT/AAATATCTCACA...AAAAG|GGG | 0 | 1 | 18.01 |
| 48954004 | GT-AG | 0 | 1.000000099473604e-05 | 111 | rna-XM_036594868.1 9059403 | 17 | 21500086 | 21500196 | Colossoma macropomum 42526 | AAA|GTAAGTGATG...TATGTTTTGATA/TCAATACTCATC...AATAG|GGG | 0 | 1 | 18.926 |
| 48954005 | GT-AG | 0 | 1.000000099473604e-05 | 167 | rna-XM_036594868.1 9059403 | 18 | 21500230 | 21500396 | Colossoma macropomum 42526 | TAT|GTAAGTGGGC...ATTTTTTTATTT/TTTTTATTTATT...TAAAG|GGT | 0 | 1 | 19.597 |
| 48954006 | GT-AG | 0 | 1.000000099473604e-05 | 2501 | rna-XM_036594868.1 9059403 | 19 | 21500458 | 21502958 | Colossoma macropomum 42526 | GAG|GTAGGTTAGA...GTTTGTTTAGCA/TTGCAATTCATG...TGCAG|GAT | 1 | 1 | 20.838 |
| 48954007 | GT-AG | 0 | 1.000000099473604e-05 | 760 | rna-XM_036594868.1 9059403 | 20 | 21502995 | 21503754 | Colossoma macropomum 42526 | CAG|GTAAGAGCAG...GTATTCCTAATT/GTATTCCTAATT...TATAG|GCA | 1 | 1 | 21.571 |
| 48954008 | GT-AG | 0 | 1.000000099473604e-05 | 370 | rna-XM_036594868.1 9059403 | 21 | 21503902 | 21504271 | Colossoma macropomum 42526 | TAA|GTAAGTGTTG...GTTATGTTGACA/GTTATGTTGACA...TGTAG|GTG | 1 | 1 | 24.562 |
| 48954009 | GT-AG | 0 | 1.000000099473604e-05 | 2279 | rna-XM_036594868.1 9059403 | 22 | 21504350 | 21506628 | Colossoma macropomum 42526 | AAG|GTGAGAAGGG...TTAACTTTAAAA/GGTAACTTAACT...CTTAG|GTG | 1 | 1 | 26.15 |
| 48954010 | GT-AG | 0 | 1.000000099473604e-05 | 288 | rna-XM_036594868.1 9059403 | 23 | 21506713 | 21507000 | Colossoma macropomum 42526 | AAG|GTGAGGTTCC...CTTTTCTTGTTT/ACATTATTCATC...ACTAG|GTT | 1 | 1 | 27.859 |
| 48954011 | GT-AG | 0 | 1.000000099473604e-05 | 925 | rna-XM_036594868.1 9059403 | 24 | 21507072 | 21507996 | Colossoma macropomum 42526 | CCT|GTAAGTGGTG...TTTTTCTTTGAC/GTTTCTCTCACA...TTCAG|GGT | 0 | 1 | 29.304 |
| 48954012 | GT-AG | 0 | 2.484407177703753e-05 | 175 | rna-XM_036594868.1 9059403 | 25 | 21508189 | 21508363 | Colossoma macropomum 42526 | CCA|GTGAGTTTTA...CAAACCTTATTT/ATATTTGTTATT...CAAAG|GCC | 0 | 1 | 33.211 |
| 48954013 | GC-AG | 0 | 1.000000099473604e-05 | 143 | rna-XM_036594868.1 9059403 | 26 | 21508533 | 21508675 | Colossoma macropomum 42526 | CAG|GCTAGCATTT...CTCACTTTCAAA/AAAAATCTCACT...TCTAG|GTC | 1 | 1 | 36.65 |
| 48954014 | GT-AG | 0 | 0.0006110390340296 | 103 | rna-XM_036594868.1 9059403 | 27 | 21508769 | 21508871 | Colossoma macropomum 42526 | AAG|GTAACTTGTG...GTGATTATGATT/GTGATTATGATT...TTAAG|GTG | 1 | 1 | 38.543 |
| 48954015 | GT-AG | 0 | 1.000000099473604e-05 | 111 | rna-XM_036594868.1 9059403 | 28 | 21508977 | 21509087 | Colossoma macropomum 42526 | AAG|GTAAATCCAG...AGTGTTTTAAAT/AGTGTTTTAAAT...TTCAG|GAT | 1 | 1 | 40.68 |
| 48954016 | GT-AG | 0 | 0.0008821524492751 | 102 | rna-XM_036594868.1 9059403 | 29 | 21509186 | 21509287 | Colossoma macropomum 42526 | AAA|GTATGATGGT...TCTATCTTAAAT/TTTTTATTGATT...AAAAG|GGT | 0 | 1 | 42.674 |
| 48954017 | GT-AG | 0 | 1.000000099473604e-05 | 195 | rna-XM_036594868.1 9059403 | 30 | 21509439 | 21509633 | Colossoma macropomum 42526 | AAG|GTGAGTTAAT...TCATCCTTACCT/TTTTTTTTCATA...TCCAG|GTG | 1 | 1 | 45.747 |
| 48954018 | GT-AG | 0 | 1.000000099473604e-05 | 101 | rna-XM_036594868.1 9059403 | 31 | 21509748 | 21509848 | Colossoma macropomum 42526 | TTG|GTGAGTTCCT...TATCTCTGGATA/AAACTACTCATG...TGTAG|GTC | 1 | 1 | 48.067 |
| 48954019 | GT-AG | 0 | 1.000000099473604e-05 | 111 | rna-XM_036594868.1 9059403 | 32 | 21510017 | 21510127 | Colossoma macropomum 42526 | GAG|GTGAGTGCTC...GTTGCTTTTGCT/TGTGAATTTATT...TATAG|GTC | 1 | 1 | 51.486 |
| 48954020 | GT-AG | 0 | 1.000000099473604e-05 | 181 | rna-XM_036594868.1 9059403 | 33 | 21510218 | 21510398 | Colossoma macropomum 42526 | GAG|GTAAAGCATC...CTTTCATTAATC/TGGTCTTTCATT...GGCAG|GTG | 1 | 1 | 53.317 |
| 48954021 | GT-AG | 0 | 1.000000099473604e-05 | 99 | rna-XM_036594868.1 9059403 | 34 | 21510552 | 21510650 | Colossoma macropomum 42526 | GAG|GTTTGAGATT...ATTGTCTGAAAT/AACTAACTGATT...TTCAG|GTG | 1 | 1 | 56.431 |
| 48954022 | GT-AG | 0 | 1.000000099473604e-05 | 128 | rna-XM_036594868.1 9059403 | 35 | 21510750 | 21510877 | Colossoma macropomum 42526 | CAG|GTAATGTTTC...CATGTTTTAATC/CATGTTTTAATC...CTTAG|GTG | 1 | 1 | 58.445 |
| 48954023 | GT-AG | 0 | 1.5693450759280464e-05 | 317 | rna-XM_036594868.1 9059403 | 36 | 21510968 | 21511284 | Colossoma macropomum 42526 | CAG|GTACAATGGA...TTCCTTTTGATT/TTTGATTTTATT...TGCAG|GAG | 1 | 1 | 60.277 |
| 48954024 | GT-AG | 0 | 1.000000099473604e-05 | 135 | rna-XM_036594868.1 9059403 | 37 | 21511425 | 21511559 | Colossoma macropomum 42526 | AAG|GTGCAGTATT...GGTACCTTTGCC/TCTCCATACATC...CACAG|GGT | 0 | 1 | 63.126 |
| 48954025 | GC-AG | 0 | 1.000000099473604e-05 | 83 | rna-XM_036594868.1 9059403 | 38 | 21511687 | 21511769 | Colossoma macropomum 42526 | CAG|GCAAGTTCTG...AAAATGTTAATT/AAAATGTTAATT...AACAG|GTA | 1 | 1 | 65.71 |
| 48954026 | GT-AG | 0 | 1.000000099473604e-05 | 169 | rna-XM_036594868.1 9059403 | 39 | 21511851 | 21512019 | Colossoma macropomum 42526 | CAG|GTTGGTAAAA...CACTTTTTACAG/CCACTTTTTACA...TCCAG|GTG | 1 | 1 | 67.359 |
| 48954027 | GT-AG | 0 | 1.000000099473604e-05 | 303 | rna-XM_036594868.1 9059403 | 40 | 21512119 | 21512421 | Colossoma macropomum 42526 | CAG|GTTAGTGATT...GTTATGTTGACC/GTTATGTTGACC...TTCAG|GTG | 1 | 1 | 69.373 |
| 48954028 | GT-AG | 0 | 1.000000099473604e-05 | 3369 | rna-XM_036594868.1 9059403 | 41 | 21512473 | 21515841 | Colossoma macropomum 42526 | AAG|GTCAGAGTCA...TTAGCCAAAATC/ACCAGGCTCAGT...CTCAG|GTG | 1 | 1 | 70.411 |
| 48954029 | GT-AG | 0 | 1.000000099473604e-05 | 536 | rna-XM_036594868.1 9059403 | 42 | 21516028 | 21516563 | Colossoma macropomum 42526 | CAG|GTAAATCTCC...CTTTCCATATCA/TTCATATTTAAT...TGCAG|GGC | 1 | 1 | 74.196 |
| 48954030 | GT-AG | 0 | 3.9147037815174254e-05 | 86 | rna-XM_036594868.1 9059403 | 43 | 21516698 | 21516783 | Colossoma macropomum 42526 | CCA|GTAAGTTAAA...CATTTCATGATT/CATTTCATGATT...TTCAG|GGT | 0 | 1 | 76.923 |
| 48954031 | GT-AG | 0 | 1.000000099473604e-05 | 171 | rna-XM_036594868.1 9059403 | 44 | 21516857 | 21517027 | Colossoma macropomum 42526 | CAG|GTAAGACTCA...TGTACTTTACTC/TTGTACTTTACT...GTCAG|GAA | 1 | 1 | 78.409 |
| 48954032 | GT-AG | 0 | 1.000000099473604e-05 | 149 | rna-XM_036594868.1 9059403 | 45 | 21517100 | 21517248 | Colossoma macropomum 42526 | AAG|GTTTGTAGAA...AAATTATTAATT/ATTAATTTCATT...TCTAG|GTA | 1 | 1 | 79.874 |
| 48954033 | GT-AG | 0 | 2.8110290314767367e-05 | 181 | rna-XM_036594868.1 9059403 | 46 | 21517378 | 21517558 | Colossoma macropomum 42526 | AAG|GTTGGTTTTC...CTTTCTTTAACA/CTTTCTTTAACA...TCCAG|GAT | 1 | 1 | 82.499 |
| 48954034 | GT-AG | 0 | 0.0001708448056125 | 108 | rna-XM_036594868.1 9059403 | 47 | 21517658 | 21517765 | Colossoma macropomum 42526 | CTG|GTATGTGTGC...TACACCTTGTCA/TTGGATTTAATC...TTTAG|GAC | 1 | 1 | 84.514 |
| 48954035 | GT-AG | 0 | 1.000000099473604e-05 | 222 | rna-XM_036594868.1 9059403 | 48 | 21517979 | 21518200 | Colossoma macropomum 42526 | TGG|GTTAGTATCA...TCAATTTTAACA/TCAATTTTAACA...CATAG|GTA | 1 | 1 | 88.848 |
| 48954036 | GT-AG | 0 | 1.000000099473604e-05 | 441 | rna-XM_036594868.1 9059403 | 49 | 21518379 | 21518819 | Colossoma macropomum 42526 | CAG|GTCAGTGAGT...GGTGTTTTGTAT/TGTACGTTCATC...TATAG|ATG | 2 | 1 | 92.47 |
| 48954037 | GT-AG | 0 | 1.000000099473604e-05 | 119 | rna-XM_036594868.1 9059403 | 50 | 21518935 | 21519053 | Colossoma macropomum 42526 | ATG|GTGAGAGAAA...CTCTCTTTCTCT/TAAGCTCTGATT...CTCAG|CAC | 0 | 1 | 94.811 |
| 48954038 | GT-AG | 0 | 0.0012919555945562 | 2603 | rna-XM_036594868.1 9059403 | 51 | 21519227 | 21521829 | Colossoma macropomum 42526 | TAG|GTACGCTTAC...TGCTGCTTATTT/TTGCTGCTTATT...TCCAG|GAA | 2 | 1 | 98.331 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);