introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
32 rows where transcript_id = 6092548
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 31463456 | GT-AG | 0 | 1.000000099473604e-05 | 66357 | rna-XM_030957750.1 6092548 | 1 | 4179782 | 4246138 | Camarhynchus parvulus 87175 | ATG|GTGAGCCCCG...TTCTCCTTGTCT/CAGGTTTTCAAG...TGCAG|TTT | 1 | 1 | 0.713 |
| 31463457 | GT-AG | 0 | 1.000000099473604e-05 | 5584 | rna-XM_030957750.1 6092548 | 2 | 4173880 | 4179463 | Camarhynchus parvulus 87175 | CTG|GTAAGTTGAA...ATTTCTGTAACA/ATTTCTGTAACA...TGCAG|TTT | 1 | 1 | 5.987 |
| 31463458 | GT-AG | 0 | 0.0001124483454372 | 197997 | rna-XM_030957750.1 6092548 | 3 | 3975736 | 4173732 | Camarhynchus parvulus 87175 | CAG|GTAGGCTCTT...GATATTTTAAAC/GATATTTTAAAC...TTCAG|GAC | 1 | 1 | 8.425 |
| 31463459 | GT-AG | 0 | 2.3396343107810705e-05 | 6318 | rna-XM_030957750.1 6092548 | 4 | 3969271 | 3975588 | Camarhynchus parvulus 87175 | CAG|GTGCTCCTTT...CGTTGCTTAGCG/TTAATTTTAAGT...TTCAG|ACC | 1 | 1 | 10.862 |
| 31463460 | GT-AG | 0 | 0.0002519867132224 | 6536 | rna-XM_030957750.1 6092548 | 5 | 3962456 | 3968991 | Camarhynchus parvulus 87175 | CAC|GTAGGTCCTG...TCTCCCTTATTT/CCTTATTTCAAC...TCCAG|AAC | 1 | 1 | 15.489 |
| 31463461 | GT-AG | 0 | 1.000000099473604e-05 | 3723 | rna-XM_030957750.1 6092548 | 6 | 3958457 | 3962179 | Camarhynchus parvulus 87175 | AAG|GTCAGTCTCC...CAAGCTTTAAAC/CAAGCTTTAAAC...TGCAG|ACG | 1 | 1 | 20.066 |
| 31463462 | GT-AG | 0 | 1.000000099473604e-05 | 818 | rna-XM_030957750.1 6092548 | 7 | 3957342 | 3958159 | Camarhynchus parvulus 87175 | GAG|GTGCTTGTCA...ATGCCTTTAAAT/ATTGTCCTCAAG...GGCAG|GGC | 1 | 1 | 24.992 |
| 31463463 | GT-AG | 0 | 1.000000099473604e-05 | 13094 | rna-XM_030957750.1 6092548 | 8 | 3943972 | 3957065 | Camarhynchus parvulus 87175 | AAG|GTGAGCACAG...GTGGCTTCAACA/CGTGGCTTCAAC...TCCAG|TCC | 1 | 1 | 29.569 |
| 31463464 | GT-AG | 0 | 1.000000099473604e-05 | 11393 | rna-XM_030957750.1 6092548 | 9 | 3932300 | 3943692 | Camarhynchus parvulus 87175 | GAG|GTGAGGAGGT...TCTCCCTTTGCT/GGAATTCTGATG...CACAG|TGC | 1 | 1 | 34.196 |
| 31463465 | GT-AG | 0 | 1.000000099473604e-05 | 4689 | rna-XM_030957750.1 6092548 | 10 | 3927491 | 3932179 | Camarhynchus parvulus 87175 | AAG|GTAGGACTCC...TTCTTTTTAACC/TTCTTTTTAACC...TGCAG|GTG | 1 | 1 | 36.186 |
| 31463466 | GT-AG | 0 | 1.000000099473604e-05 | 32002 | rna-XM_030957750.1 6092548 | 11 | 3895315 | 3927316 | Camarhynchus parvulus 87175 | AAA|GTAAGAACTG...ACAACCTTGTTT/CTTCAGCTCACT...TCTAG|TTC | 1 | 1 | 39.071 |
| 31463467 | GT-AG | 0 | 1.000000099473604e-05 | 2510 | rna-XM_030957750.1 6092548 | 12 | 3892608 | 3895117 | Camarhynchus parvulus 87175 | CAG|GTAAGAAAAC...TTTATTTTCTCT/GTGCTGTTCAAT...TATAG|ATT | 0 | 1 | 42.338 |
| 31463468 | GT-AG | 0 | 1.000000099473604e-05 | 1355 | rna-XM_030957750.1 6092548 | 13 | 3891156 | 3892510 | Camarhynchus parvulus 87175 | AAG|GTAGGCAGAG...GTGTTCTGAGCA/TGTGTTCTGAGC...CCCAG|AGC | 1 | 1 | 43.947 |
| 31463469 | GT-AG | 0 | 1.000000099473604e-05 | 15334 | rna-XM_030957750.1 6092548 | 14 | 3875693 | 3891026 | Camarhynchus parvulus 87175 | CAG|GTAGGCAACA...TTTCTATTGATA/TTTCTATTGATA...ATCAG|ACT | 1 | 1 | 46.086 |
| 31463470 | GT-AG | 0 | 4.61979305738667e-05 | 3603 | rna-XM_030957750.1 6092548 | 15 | 3871922 | 3875524 | Camarhynchus parvulus 87175 | CAG|GTATGACCTG...TTTTCCTTCTCT/CCACTTTTGATG...TCCAG|CCC | 1 | 1 | 48.872 |
| 31463471 | GT-AG | 0 | 1.000000099473604e-05 | 5994 | rna-XM_030957750.1 6092548 | 16 | 3865857 | 3871850 | Camarhynchus parvulus 87175 | AAG|GTAAATACTC...TTTCCATTTTCT/TCATGTGTAACT...CTCAG|GCT | 0 | 1 | 50.05 |
| 31463472 | GT-AG | 0 | 1.000000099473604e-05 | 1669 | rna-XM_030957750.1 6092548 | 17 | 3863947 | 3865615 | Camarhynchus parvulus 87175 | ACG|GTGGGTCCTT...GTTGTTTTCATT/GTTGTTTTCATT...TCCAG|TGC | 1 | 1 | 54.046 |
| 31463473 | GT-AG | 0 | 1.000000099473604e-05 | 1746 | rna-XM_030957750.1 6092548 | 18 | 3862054 | 3863799 | Camarhynchus parvulus 87175 | GAG|GTGAGAAACA...TTCTTTTTATTT/TTTTATTTCATT...TATAG|AGC | 1 | 1 | 56.484 |
| 31463474 | GT-AG | 0 | 1.000000099473604e-05 | 8638 | rna-XM_030957750.1 6092548 | 19 | 3853260 | 3861897 | Camarhynchus parvulus 87175 | ATG|GTAAGGAGTG...GCCACTCTAACA/GCCACTCTAACA...TCCAG|TTC | 1 | 1 | 59.071 |
| 31463475 | GT-AG | 0 | 1.000000099473604e-05 | 13408 | rna-XM_030957750.1 6092548 | 20 | 3839718 | 3853125 | Camarhynchus parvulus 87175 | ACA|GTAAGTGCCA...CTTATTTTAGCC/ATACTGCTTATT...CACAG|GTG | 0 | 1 | 61.294 |
| 31463476 | GT-AG | 0 | 1.000000099473604e-05 | 7532 | rna-XM_030957750.1 6092548 | 21 | 3832032 | 3839563 | Camarhynchus parvulus 87175 | AAG|GTACAGGGAC...ATGTCCGTGACA/GGACACCTCACT...CCCAG|CTC | 1 | 1 | 63.847 |
| 31463477 | GT-AG | 0 | 0.0292908330918955 | 809 | rna-XM_030957750.1 6092548 | 22 | 3831105 | 3831913 | Camarhynchus parvulus 87175 | TAG|GTAACCAGCA...ACTTCTTTAATG/TTAATTTTGATT...GGCAG|TAA | 2 | 1 | 65.804 |
| 31463478 | GT-AG | 0 | 1.000000099473604e-05 | 2802 | rna-XM_030957750.1 6092548 | 23 | 3828139 | 3830940 | Camarhynchus parvulus 87175 | AAG|GTAAGGCTGT...CTATTTTTAACT/CTATTTTTAACT...TCTAG|TTC | 1 | 1 | 68.524 |
| 31463479 | GT-AG | 0 | 1.000000099473604e-05 | 552 | rna-XM_030957750.1 6092548 | 24 | 3827488 | 3828039 | Camarhynchus parvulus 87175 | GAG|GTAATGTAAT...GGATTTGTGATA/GGATTTGTGATA...TCCAG|GTT | 1 | 1 | 70.166 |
| 31463480 | GT-AG | 0 | 1.000000099473604e-05 | 1373 | rna-XM_030957750.1 6092548 | 25 | 3825926 | 3827298 | Camarhynchus parvulus 87175 | AAG|GTGCGTTGGT...TTTGTCTGGAAC/CAATATTTAAAG...TCTAG|AGC | 1 | 1 | 73.3 |
| 31463481 | GT-AG | 0 | 0.0001348202875985 | 4872 | rna-XM_030957750.1 6092548 | 26 | 3820763 | 3825634 | Camarhynchus parvulus 87175 | GCA|GTAAGTATTT...AATGCCATGACC/AGGCCTCTAATG...TGCAG|GTA | 1 | 1 | 78.126 |
| 31463482 | GT-AG | 0 | 1.000000099473604e-05 | 7281 | rna-XM_030957750.1 6092548 | 27 | 3813305 | 3820585 | Camarhynchus parvulus 87175 | GAG|GTAAGGGGGT...TAGATTTTGATT/TAGATTTTGATT...TCTAG|ATG | 1 | 1 | 81.061 |
| 31463483 | GT-AG | 0 | 1.000000099473604e-05 | 6627 | rna-XM_030957750.1 6092548 | 28 | 3806647 | 3813273 | Camarhynchus parvulus 87175 | GAG|GTAAGAACAA...AATGTCTTCATT/TATCTTTTCATT...TTCAG|CAA | 2 | 1 | 81.575 |
| 31463484 | GT-AG | 0 | 7.724640207015087e-05 | 2886 | rna-XM_030957750.1 6092548 | 29 | 3803645 | 3806530 | Camarhynchus parvulus 87175 | TTG|GTAAATTTGA...CATTCTCTGATT/TCTGTGCTCATT...TACAG|ATG | 1 | 1 | 83.499 |
| 31463485 | GT-AG | 0 | 2.791599034082247e-05 | 4448 | rna-XM_030957750.1 6092548 | 30 | 3799047 | 3803494 | Camarhynchus parvulus 87175 | CAA|GTAAGCACAT...GTTTCCTTCCCA/GTGATTCTGACA...CACAG|ATC | 1 | 1 | 85.987 |
| 31463486 | GT-AG | 0 | 1.000000099473604e-05 | 3633 | rna-XM_030957750.1 6092548 | 31 | 3795216 | 3798848 | Camarhynchus parvulus 87175 | CAG|GTACGGACAG...CTCCCGTTATCC/TCTCCCGTTATC...CTCAG|ACC | 1 | 1 | 89.27 |
| 31463487 | GT-AG | 0 | 1.000000099473604e-05 | 19336 | rna-XM_030957750.1 6092548 | 32 | 3775577 | 3794912 | Camarhynchus parvulus 87175 | CAG|GTGAGGCAGC...TCTCTTTTGCCT/TAAAATCTCACT...CCCAG|GTG | 1 | 1 | 94.295 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);