introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
47 rows where transcript_id = 9059369
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 48952920 | GT-AG | 0 | 1.000000099473604e-05 | 3373 | rna-XM_036585626.1 9059369 | 2 | 18574526 | 18577898 | Colossoma macropomum 42526 | CAG|GTCAGTGGCC...ATTCCCTGAAAT/TAGTGTCTGATT...CCCAG|ACA | 0 | 1 | 4.158 |
| 48952921 | GT-AG | 0 | 0.0003524595840526 | 113 | rna-XM_036585626.1 9059369 | 3 | 18574267 | 18574379 | Colossoma macropomum 42526 | CAG|GTACATTTAT...GATGTGTTAATG/CATTTGCTTACA...TCCAG|ACC | 2 | 1 | 5.97 |
| 48952922 | GT-AG | 0 | 1.000000099473604e-05 | 529 | rna-XM_036585626.1 9059369 | 4 | 18573658 | 18574186 | Colossoma macropomum 42526 | AAG|GTAAAACAAA...CAGCCTTTAATA/TTTTTCTTTACA...CCAAG|GAC | 1 | 1 | 6.963 |
| 48952923 | GT-AG | 0 | 1.000000099473604e-05 | 454 | rna-XM_036585626.1 9059369 | 5 | 18573091 | 18573544 | Colossoma macropomum 42526 | CAC|GTGAGTTCCA...TGTTTTTTGTCC/ATGTTACTGACC...CATAG|AGG | 0 | 1 | 8.365 |
| 48952924 | GT-AG | 0 | 1.000000099473604e-05 | 640 | rna-XM_036585626.1 9059369 | 6 | 18572232 | 18572871 | Colossoma macropomum 42526 | AAG|GTAAGAGACT...ATTTATTTATTT/TATTTATTTATT...TGCAG|GGC | 0 | 1 | 11.084 |
| 48952925 | GT-AG | 0 | 3.8065781839331146e-05 | 117 | rna-XM_036585626.1 9059369 | 7 | 18571999 | 18572115 | Colossoma macropomum 42526 | CAA|GTAACGACAC...TAAACATTGATT/GATTTATTTATT...TTCAG|GCC | 2 | 1 | 12.523 |
| 48952926 | GT-AG | 0 | 1.000000099473604e-05 | 95 | rna-XM_036585626.1 9059369 | 8 | 18571652 | 18571746 | Colossoma macropomum 42526 | TAG|GTGAGAAATT...TTTTTCTTATTA/GTTTTTCTTATT...CACAG|GTT | 2 | 1 | 15.651 |
| 48952927 | GT-AG | 0 | 0.0008394137157713 | 182 | rna-XM_036585626.1 9059369 | 9 | 18571331 | 18571512 | Colossoma macropomum 42526 | GCA|GTAAGTTGTG...GAGTCTTTATCT/TGAGTCTTTATC...TCCAG|GAG | 0 | 1 | 17.376 |
| 48952928 | GT-AG | 0 | 1.000000099473604e-05 | 257 | rna-XM_036585626.1 9059369 | 10 | 18570921 | 18571177 | Colossoma macropomum 42526 | CAG|GTAGAGAAGG...TTCAGCTTAGTT/TTAGTTGTCAAT...CTTAG|GCT | 0 | 1 | 19.275 |
| 48952929 | GT-AG | 0 | 8.535852984084633e-05 | 623 | rna-XM_036585626.1 9059369 | 11 | 18570193 | 18570815 | Colossoma macropomum 42526 | AAG|GTTTGTTTTT...TAGTTCTGATTT/GTAGTTCTGATT...TACAG|GAA | 0 | 1 | 20.578 |
| 48952930 | GT-AG | 0 | 1.000000099473604e-05 | 142 | rna-XM_036585626.1 9059369 | 12 | 18569844 | 18569985 | Colossoma macropomum 42526 | CAG|GTACAAGTTA...ACATCTTTAAGT/ACATCTTTAAGT...TTCAG|GAC | 0 | 1 | 23.148 |
| 48952931 | GT-AG | 0 | 1.000000099473604e-05 | 313 | rna-XM_036585626.1 9059369 | 13 | 18569394 | 18569706 | Colossoma macropomum 42526 | CAG|GTAAAGAGCC...TATATCGTAAGA/GTAAGAGTAATA...TTCAG|AAA | 2 | 1 | 24.848 |
| 48952932 | GT-AG | 0 | 1.000000099473604e-05 | 521 | rna-XM_036585626.1 9059369 | 14 | 18568837 | 18569357 | Colossoma macropomum 42526 | AGG|GTAAGTGTAA...TTTGTTTTATTT/GTTTGTTTTATT...TTTAG|TCA | 2 | 1 | 25.295 |
| 48952933 | GT-AG | 0 | 1.000000099473604e-05 | 159 | rna-XM_036585626.1 9059369 | 15 | 18568544 | 18568702 | Colossoma macropomum 42526 | AAG|GTTGGTTATT...TAGTTTTTGTCG/TCATGGTTAATG...TTTAG|CAG | 1 | 1 | 26.958 |
| 48952934 | GT-AG | 0 | 1.000000099473604e-05 | 1041 | rna-XM_036585626.1 9059369 | 16 | 18567412 | 18568452 | Colossoma macropomum 42526 | GAG|GTATGAGGAT...GCATTTTTGATT/GCATTTTTGATT...CACAG|GTT | 2 | 1 | 28.087 |
| 48952935 | GT-AG | 0 | 0.0001582956973011 | 968 | rna-XM_036585626.1 9059369 | 17 | 18566101 | 18567068 | Colossoma macropomum 42526 | AGG|GTAGGTATTG...AAACCCTTAATT/AAACCCTTAATT...CTTAG|GTT | 0 | 1 | 32.345 |
| 48952936 | GT-AG | 0 | 1.000000099473604e-05 | 688 | rna-XM_036585626.1 9059369 | 18 | 18565317 | 18566004 | Colossoma macropomum 42526 | CAG|GTAAGACTTG...TTTGTTTTAAAT/TTTGTTTTAAAT...TGCAG|GTA | 0 | 1 | 33.536 |
| 48952937 | GT-AG | 0 | 1.000000099473604e-05 | 107 | rna-XM_036585626.1 9059369 | 19 | 18564998 | 18565104 | Colossoma macropomum 42526 | CAG|GTGCTTCACA...CTTTTTTTAAAC/CTTTTTTTAAAC...TGCAG|AGC | 2 | 1 | 36.167 |
| 48952938 | GT-AG | 0 | 1.000000099473604e-05 | 187 | rna-XM_036585626.1 9059369 | 20 | 18564570 | 18564756 | Colossoma macropomum 42526 | ACG|GTAAAAATCT...TTTGTTTTGCTT/TTTTCTATCATT...CTCAG|CTC | 0 | 1 | 39.158 |
| 48952939 | GT-AG | 0 | 1.000000099473604e-05 | 219 | rna-XM_036585626.1 9059369 | 21 | 18564201 | 18564419 | Colossoma macropomum 42526 | GTG|GTGAGGAACA...TATTCCTTGCCA/CGAGTGTTCATT...TGCAG|ATA | 0 | 1 | 41.02 |
| 48952940 | GT-AG | 0 | 1.000000099473604e-05 | 382 | rna-XM_036585626.1 9059369 | 22 | 18563698 | 18564079 | Colossoma macropomum 42526 | CAG|GTGTGCGACA...TTGCATTTAGAA/CTGTTGTTCATT...TCCAG|ATA | 1 | 1 | 42.522 |
| 48952941 | GT-AG | 0 | 1.000000099473604e-05 | 148 | rna-XM_036585626.1 9059369 | 23 | 18563419 | 18563566 | Colossoma macropomum 42526 | GAG|GTTTGTCTGT...TCAGCGTTGAAA/GCAAGTATGATT...CTCAG|GTG | 0 | 1 | 44.148 |
| 48952942 | GT-AG | 0 | 0.0009601681022307 | 663 | rna-XM_036585626.1 9059369 | 24 | 18562477 | 18563139 | Colossoma macropomum 42526 | CCT|GTAAGTTTCC...GTCTCTTTGTCT/CTCTCTCTCACT...CATAG|ATA | 0 | 1 | 47.611 |
| 48952943 | GT-AG | 0 | 1.000000099473604e-05 | 3093 | rna-XM_036585626.1 9059369 | 25 | 18559258 | 18562350 | Colossoma macropomum 42526 | GAG|GTATGAGCGA...AGTAACTTGACT/ACTTGACTTATC...CTCAG|AAC | 0 | 1 | 49.175 |
| 48952944 | GT-AG | 0 | 1.000000099473604e-05 | 291 | rna-XM_036585626.1 9059369 | 26 | 18558829 | 18559119 | Colossoma macropomum 42526 | AAG|GTGCCAGATC...AAGTTTTTATAT/AAAGTTTTTATA...CTTAG|ACT | 0 | 1 | 50.887 |
| 48952945 | GT-AG | 0 | 1.000000099473604e-05 | 109 | rna-XM_036585626.1 9059369 | 27 | 18558553 | 18558661 | Colossoma macropomum 42526 | CAA|GTGAGTAGCA...ACTTTTTTATAT/GACTTTTTTATA...TTCAG|GTC | 2 | 1 | 52.96 |
| 48952946 | GT-AG | 0 | 4.1716452275742816e-05 | 118 | rna-XM_036585626.1 9059369 | 28 | 18558326 | 18558443 | Colossoma macropomum 42526 | GGT|GTAAGTGTTT...TTGTTGTTAATG/GCTTGTTTTATT...TCTAG|AGC | 0 | 1 | 54.313 |
| 48952947 | GT-AG | 0 | 7.892851119569995e-05 | 91 | rna-XM_036585626.1 9059369 | 29 | 18558088 | 18558178 | Colossoma macropomum 42526 | AAG|GTATGGTAAC...TTTTACTTGACT/TTTTACTTGACT...TTCAG|GAT | 0 | 1 | 56.138 |
| 48952948 | GT-AG | 0 | 0.0044534604268632 | 238 | rna-XM_036585626.1 9059369 | 30 | 18557703 | 18557940 | Colossoma macropomum 42526 | AAG|GTAACCCTAG...CTCCTTTTATCC/GATTTGCTCACT...TGCAG|GAG | 0 | 1 | 57.962 |
| 48952949 | GT-AG | 0 | 1.000000099473604e-05 | 412 | rna-XM_036585626.1 9059369 | 31 | 18557083 | 18557494 | Colossoma macropomum 42526 | CAG|GTATGGACAG...TTTGTTTTAAAT/ATATGTTTAATT...CGTAG|GTT | 1 | 1 | 60.544 |
| 48952950 | GT-AG | 0 | 1.000000099473604e-05 | 444 | rna-XM_036585626.1 9059369 | 32 | 18556418 | 18556861 | Colossoma macropomum 42526 | GAG|GTATTGCCTG...TTGTTCTCATGT/CTTGTTCTCATG...TTCAG|AGT | 0 | 1 | 63.287 |
| 48952951 | GT-AG | 0 | 1.000000099473604e-05 | 1043 | rna-XM_036585626.1 9059369 | 33 | 18555184 | 18556226 | Colossoma macropomum 42526 | TAG|GTAAATATGC...AGTGACTTAACA/CTCTTTGTTATT...TGCAG|GTT | 2 | 1 | 65.657 |
| 48952952 | GT-AG | 0 | 1.000000099473604e-05 | 111 | rna-XM_036585626.1 9059369 | 34 | 18554899 | 18555009 | Colossoma macropomum 42526 | CAG|GTAAGCGTAG...TTGCTTATGATA/TTGGAATTAACA...CCCAG|GTA | 2 | 1 | 67.817 |
| 48952953 | GT-AG | 0 | 0.01119306076264 | 97 | rna-XM_036585626.1 9059369 | 35 | 18554660 | 18554756 | Colossoma macropomum 42526 | AAA|GTATGTTCTT...TGTGTTGTAACC/TGTGTTGTAACC...TGCAG|GTA | 0 | 1 | 69.579 |
| 48952954 | GT-AG | 0 | 1.1150697698455364e-05 | 97 | rna-XM_036585626.1 9059369 | 36 | 18554359 | 18554455 | Colossoma macropomum 42526 | CAG|GTACTACTAC...TGTCTTTTGATT/TGTCTTTTGATT...TCCAG|CAG | 0 | 1 | 72.111 |
| 48952955 | GT-AG | 0 | 1.000000099473604e-05 | 843 | rna-XM_036585626.1 9059369 | 37 | 18552957 | 18553799 | Colossoma macropomum 42526 | CAG|GTAAAAGCAG...TTTACCTTGATT/TTTTGTTTTACC...ATTAG|GCC | 1 | 1 | 79.049 |
| 48952956 | GT-AG | 0 | 6.949646012267661e-05 | 118 | rna-XM_036585626.1 9059369 | 38 | 18552715 | 18552832 | Colossoma macropomum 42526 | TTG|GTACGTGTAT...TGCCTTTTGATT/TGCCTTTTGATT...CCTAG|GTA | 2 | 1 | 80.588 |
| 48952957 | GT-AG | 0 | 1.000000099473604e-05 | 116 | rna-XM_036585626.1 9059369 | 39 | 18552373 | 18552488 | Colossoma macropomum 42526 | CAG|GTGAGCTTGT...TCTCTCTTGATG/CTGCTTTTTATT...GTTAG|GGC | 0 | 1 | 83.393 |
| 48952958 | GT-AG | 0 | 1.000000099473604e-05 | 636 | rna-XM_036585626.1 9059369 | 40 | 18551607 | 18552242 | Colossoma macropomum 42526 | TTG|GTAAGGAACC...TAACTTCTGATG/CTTTATGTAACT...TGCAG|GCT | 1 | 1 | 85.007 |
| 48952959 | GT-AG | 0 | 1.000000099473604e-05 | 252 | rna-XM_036585626.1 9059369 | 41 | 18551169 | 18551420 | Colossoma macropomum 42526 | ACG|GTGAGTCTCT...ACGTTTTTTTCT/CTGGTACTCACC...AATAG|GTA | 1 | 1 | 87.315 |
| 48952960 | GT-AG | 0 | 1.000000099473604e-05 | 763 | rna-XM_036585626.1 9059369 | 42 | 18550185 | 18550947 | Colossoma macropomum 42526 | CAA|GTAAGAAACA...CCCTTTTTATTT/CCCCTTTTTATT...AACAG|GTT | 0 | 1 | 90.058 |
| 48952961 | GT-AG | 0 | 1.319500948307649e-05 | 105 | rna-XM_036585626.1 9059369 | 43 | 18549991 | 18550095 | Colossoma macropomum 42526 | CAG|GTCAGCTTAA...AAACTCTTAATT/TTAAATCTAAAT...ACCAG|GAT | 2 | 1 | 91.163 |
| 48952962 | GT-AG | 0 | 1.000000099473604e-05 | 104 | rna-XM_036585626.1 9059369 | 44 | 18549730 | 18549833 | Colossoma macropomum 42526 | CAG|GTGTAACAGA...ATTCCTATAACC/TATATTATCATT...TACAG|AGT | 0 | 1 | 93.112 |
| 48952963 | GT-AG | 0 | 1.000000099473604e-05 | 769 | rna-XM_036585626.1 9059369 | 45 | 18548748 | 18549516 | Colossoma macropomum 42526 | GAG|GTAGGAGAAA...GCATTATTAACT/GCATTATTAACT...ACTAG|TGT | 0 | 1 | 95.755 |
| 48952964 | GT-AG | 0 | 3.554848347056602e-05 | 875 | rna-XM_036585626.1 9059369 | 46 | 18547765 | 18548639 | Colossoma macropomum 42526 | ACT|GTAAGATTCG...TTCTCTTTGAGC/GTTATAATTACT...TTCAG|GAG | 0 | 1 | 97.096 |
| 48952965 | GT-AG | 0 | 0.0008332973294237 | 901 | rna-XM_036585626.1 9059369 | 47 | 18546768 | 18547668 | Colossoma macropomum 42526 | CAG|GTACCACTGA...CTTTCTTTGAGG/ATGCTCTTTATC...TATAG|CAG | 0 | 1 | 98.287 |
| 48961095 | GT-AG | 0 | 1.000000099473604e-05 | 4120 | rna-XM_036585626.1 9059369 | 1 | 18578159 | 18582278 | Colossoma macropomum 42526 | GTG|GTGAGTGGTG...TTGATTTGAATC/TATGTACTGATT...TTCAG|GTT | 0 | 2.892 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);