introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
34 rows where transcript_id = 9059387
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 48953487 | GT-AG | 0 | 1.000000099473604e-05 | 20716 | rna-XM_036595285.1 9059387 | 1 | 21840859 | 21861574 | Colossoma macropomum 42526 | GCT|GTGAGTAACG...CTTTTTTTGATG/CTTTTTTTGATG...CTCAG|GTT | 1 | 1 | 0.488 |
| 48953488 | GT-AG | 0 | 1.000000099473604e-05 | 22593 | rna-XM_036595285.1 9059387 | 2 | 21818005 | 21840597 | Colossoma macropomum 42526 | AGG|GTGAGGATGG...GTCCTCTAGATG/TAGATGCTAAGA...TGCAG|TGC | 1 | 1 | 5.036 |
| 48953489 | GT-AG | 0 | 1.000000099473604e-05 | 1634 | rna-XM_036595285.1 9059387 | 3 | 21816300 | 21817933 | Colossoma macropomum 42526 | CTG|GTAAGAACAC...TTACCCATAATC/CCCGCTCTGATC...TGCAG|TGC | 0 | 1 | 6.273 |
| 48953490 | GT-AG | 0 | 1.000000099473604e-05 | 13562 | rna-XM_036595285.1 9059387 | 4 | 21802594 | 21816155 | Colossoma macropomum 42526 | CTC|GTGAGGATTA...TATACCTTGACC/TATACCTTGACC...GTAAG|GCT | 0 | 1 | 8.782 |
| 48953491 | GT-AG | 0 | 1.000000099473604e-05 | 3844 | rna-XM_036595285.1 9059387 | 5 | 21798641 | 21802484 | Colossoma macropomum 42526 | ATG|GTAAGTTCAA...GTGTACTTATCG/TGTGTACTTATC...GTCAG|GAG | 1 | 1 | 10.681 |
| 48953492 | GT-AG | 0 | 1.000000099473604e-05 | 2052 | rna-XM_036595285.1 9059387 | 6 | 21796464 | 21798515 | Colossoma macropomum 42526 | CTG|GTAAAGATTC...TGTTATTTATTT/TTGTTATTTATT...TATAG|CTA | 0 | 1 | 12.859 |
| 48953493 | GT-AG | 0 | 1.000000099473604e-05 | 3994 | rna-XM_036595285.1 9059387 | 7 | 21792344 | 21796337 | Colossoma macropomum 42526 | CAG|GTCAGTATCT...TGTGTTTTATCT/GTGTGTTTTATC...TGTAG|ACT | 0 | 1 | 15.055 |
| 48953494 | GT-AG | 0 | 1.000000099473604e-05 | 4697 | rna-XM_036595285.1 9059387 | 8 | 21787571 | 21792267 | Colossoma macropomum 42526 | CAG|GTAAGACACA...TAACCTTTTTCT/CTAGCACTTATA...TTCAG|ACA | 1 | 1 | 16.379 |
| 48953495 | GT-AG | 0 | 0.0010052831549695 | 625 | rna-XM_036595285.1 9059387 | 9 | 21786795 | 21787419 | Colossoma macropomum 42526 | ACT|GTATGTACAT...AGTTGTTTATTG/GAGTTGTTTATT...TGCAG|GAC | 2 | 1 | 19.01 |
| 48953496 | GT-AG | 0 | 1.000000099473604e-05 | 1824 | rna-XM_036595285.1 9059387 | 10 | 21784820 | 21786643 | Colossoma macropomum 42526 | CAG|GTAAAGGAAG...ATTTCTTTACTG/TATTTCTTTACT...TTCAG|GTG | 0 | 1 | 21.641 |
| 48953497 | GT-AG | 0 | 0.0021871533320362 | 4956 | rna-XM_036595285.1 9059387 | 11 | 21779753 | 21784708 | Colossoma macropomum 42526 | TAT|GTACGTATAT...CTTTCCTTCTTT/CTTCTTTTCTTT...TGCAG|ACT | 0 | 1 | 23.576 |
| 48953498 | GT-AG | 0 | 1.000000099473604e-05 | 2684 | rna-XM_036595285.1 9059387 | 12 | 21777003 | 21779686 | Colossoma macropomum 42526 | CTG|GTAAGTGTAT...GTCTCCATATTT/CATATTTCCATC...ATCAG|GTC | 0 | 1 | 24.726 |
| 48953499 | GT-AG | 0 | 1.000000099473604e-05 | 4873 | rna-XM_036595285.1 9059387 | 13 | 21772002 | 21776874 | Colossoma macropomum 42526 | CAG|GTACAGCACA...GGATTTTTGCTT/AAAAGACCAACA...TTCAG|TGG | 2 | 1 | 26.956 |
| 48953500 | GT-AG | 0 | 1.000000099473604e-05 | 1194 | rna-XM_036595285.1 9059387 | 14 | 21770702 | 21771895 | Colossoma macropomum 42526 | CAT|GTGAGTACTT...ATTTGTTTATTT/AATTTGTTTATT...TTCAG|GTG | 0 | 1 | 28.803 |
| 48953501 | GT-AG | 0 | 1.000000099473604e-05 | 1811 | rna-XM_036595285.1 9059387 | 15 | 21768773 | 21770583 | Colossoma macropomum 42526 | GAG|GTCAGTATTT...TTTTTTTTATTT/TTTTTTTTTATT...GTCAG|CCC | 1 | 1 | 30.859 |
| 48953502 | GT-AG | 0 | 1.000000099473604e-05 | 1214 | rna-XM_036595285.1 9059387 | 16 | 21767411 | 21768624 | Colossoma macropomum 42526 | CAG|GTAAAAACAC...GTGTTTTTATGT/TGTGTTTTTATG...GTTAG|GTA | 2 | 1 | 33.438 |
| 48953503 | GT-AG | 0 | 0.0031694963186208 | 5084 | rna-XM_036595285.1 9059387 | 17 | 21762212 | 21767295 | Colossoma macropomum 42526 | CTG|GTATAGTCTG...CACACTTTGACT/CACACTTTGACT...CTCAG|GAG | 0 | 1 | 35.442 |
| 48953504 | GT-AG | 0 | 1.000000099473604e-05 | 874 | rna-XM_036595285.1 9059387 | 18 | 21761220 | 21762093 | Colossoma macropomum 42526 | GAG|GTCAGTGCAG...CTCTCTTTGTCT/GATGTACTGACC...TGTAG|TGG | 1 | 1 | 37.498 |
| 48953505 | GT-AG | 0 | 0.0009106912922665 | 2923 | rna-XM_036595285.1 9059387 | 19 | 21758225 | 21761147 | Colossoma macropomum 42526 | AAG|GTATGTTCTA...ACATTCTTATGT/AACATTCTTATG...TCCAG|GGG | 1 | 1 | 38.752 |
| 48953506 | GT-AG | 0 | 6.8048660793739e-05 | 2288 | rna-XM_036595285.1 9059387 | 20 | 21755795 | 21758082 | Colossoma macropomum 42526 | TGG|GTAGGTTTGA...TTCCCCTTGTTC/AATGTTATTATT...TACAG|GGA | 2 | 1 | 41.227 |
| 48953507 | GT-AG | 0 | 1.000000099473604e-05 | 5155 | rna-XM_036595285.1 9059387 | 21 | 21750567 | 21755721 | Colossoma macropomum 42526 | CAG|GTAAGGAGGT...ATGATGTTGATG/ATGATGTTGATG...TGTAG|TTG | 0 | 1 | 42.499 |
| 48953508 | GT-AG | 0 | 1.000000099473604e-05 | 3501 | rna-XM_036595285.1 9059387 | 22 | 21746913 | 21750413 | Colossoma macropomum 42526 | CAG|GTACAGAGAG...AGAGACTTAAAA/AAAAGGCTAACT...CGTAG|AAG | 0 | 1 | 45.165 |
| 48953509 | GT-AG | 0 | 1.000000099473604e-05 | 1089 | rna-XM_036595285.1 9059387 | 23 | 21745623 | 21746711 | Colossoma macropomum 42526 | AAG|GTGAGCTGCA...TTAGTGTTATTG/TTTAGTGTTATT...TGCAG|ATG | 0 | 1 | 48.667 |
| 48953510 | GT-AG | 0 | 0.000353571847937 | 1283 | rna-XM_036595285.1 9059387 | 24 | 21744264 | 21745546 | Colossoma macropomum 42526 | AAT|GTAAGCACTG...CTCTCCTTCTCC/TCCTTCTCCATC...AGCAG|CCA | 1 | 1 | 49.991 |
| 48953511 | GT-AG | 0 | 0.0066530499860586 | 3522 | rna-XM_036595285.1 9059387 | 25 | 21740566 | 21744087 | Colossoma macropomum 42526 | AAG|GTATGCTGTT...ACAGTGTTATCT/GTATGTGTCACA...TGCAG|CTT | 0 | 1 | 53.058 |
| 48953512 | GT-AG | 0 | 1.000000099473604e-05 | 4112 | rna-XM_036595285.1 9059387 | 26 | 21736373 | 21740484 | Colossoma macropomum 42526 | CGG|GTGAGTCTCA...TTATTTTAAACA/ATCTCTCTCACT...TCTAG|AAC | 0 | 1 | 54.469 |
| 48953513 | GT-AG | 0 | 1.000000099473604e-05 | 12160 | rna-XM_036595285.1 9059387 | 27 | 21724007 | 21736166 | Colossoma macropomum 42526 | CAG|GTAATGTGCT...CTCTTTTTAATG/CTCTTTTTAATG...TGTAG|ATA | 2 | 1 | 58.059 |
| 48953514 | GT-AG | 0 | 1.000000099473604e-05 | 90 | rna-XM_036595285.1 9059387 | 28 | 21723814 | 21723903 | Colossoma macropomum 42526 | CAG|GTAAGTACAC...ATACCCATACAA/AAAATACCCATA...ATCAG|ATG | 0 | 1 | 59.854 |
| 48953515 | GT-AG | 0 | 1.000000099473604e-05 | 96 | rna-XM_036595285.1 9059387 | 29 | 21723621 | 21723716 | Colossoma macropomum 42526 | AAG|GTAATACCAG...TGCATCTTTGTG/GTAGTGTTCACT...GGCAG|TGG | 1 | 1 | 61.544 |
| 48953516 | GT-AG | 0 | 1.000000099473604e-05 | 3573 | rna-XM_036595285.1 9059387 | 30 | 21719804 | 21723376 | Colossoma macropomum 42526 | GAA|GTGAGCGGTC...AGTAATTTGATT/ATTTGATTAATT...CTCAG|GCC | 2 | 1 | 65.795 |
| 48953517 | GT-AG | 0 | 0.001167499916907 | 874 | rna-XM_036595285.1 9059387 | 31 | 21718559 | 21719432 | Colossoma macropomum 42526 | ACC|GTACGTATAA...TGATCCATATTA/TCCATATTAACA...TGCAG|CTC | 1 | 1 | 72.26 |
| 48953518 | GT-AG | 0 | 1.000000099473604e-05 | 3917 | rna-XM_036595285.1 9059387 | 32 | 21713469 | 21717385 | Colossoma macropomum 42526 | GCG|GTGAATCTAA...GTTTCCTTGATT/GTTTCCTTGATT...TGCAG|GTT | 1 | 1 | 92.699 |
| 48953519 | GT-AG | 0 | 0.0002097744585591 | 1865 | rna-XM_036595285.1 9059387 | 33 | 21711451 | 21713315 | Colossoma macropomum 42526 | ACG|GTAACTGTAA...CATTTCTCACTG/ACATTTCTCACT...AACAG|GGA | 1 | 1 | 95.365 |
| 48953520 | GT-AG | 0 | 1.000000099473604e-05 | 15641 | rna-XM_036595285.1 9059387 | 34 | 21695724 | 21711364 | Colossoma macropomum 42526 | CAG|GTAAAGTATA...CTCTCCCTGTCC/TGAAGGCTAAGT...CTCAG|GTC | 0 | 1 | 96.864 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);