introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
29 rows where transcript_id = 1668849
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 9479253 | GT-AG | 0 | 1.000000099473604e-05 | 5259 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 1 | 1474369 | 1479627 | Anhinga anhinga 56067 | CAG|GTAAAAATAC...TTTTCCTTCTCT/TTGCACTTCATA...GCTAG|GCC | 2 | 1 | 3.269 |
| 9479254 | GT-AG | 0 | 2.606560289788741e-05 | 3243 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 2 | 1470982 | 1474224 | Anhinga anhinga 56067 | TCA|GTAAGTATGA...AGGACTTTGTCC/TGGCCTCTAAGA...GACAG|CAC | 2 | 1 | 5.9 |
| 9479255 | GT-AG | 0 | 1.000000099473604e-05 | 3520 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 3 | 1467278 | 1470797 | Anhinga anhinga 56067 | CAG|GTGAGCTACT...AGTTTCTTAATA/AGTTTCTTAATA...TTTAG|GTT | 0 | 1 | 9.26 |
| 9479256 | GT-AG | 0 | 2.2926584635687603e-05 | 3687 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 4 | 1463331 | 1467017 | Anhinga anhinga 56067 | AAG|GTATAAACAG...AATATCTTATAG/AAATATCTTATA...TTTAG|ATT | 2 | 1 | 14.009 |
| 9479257 | GT-AG | 0 | 4.114512674840223e-05 | 6140 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 5 | 1456878 | 1463017 | Anhinga anhinga 56067 | CAG|GTACTAACCT...TGTTTCTTAATT/CTGTTTCTTAAT...CCTAG|ATA | 0 | 1 | 19.726 |
| 9479258 | GT-AG | 0 | 1.000000099473604e-05 | 739 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 6 | 1455954 | 1456692 | Anhinga anhinga 56067 | ACG|GTAAATGAAT...GTTACTTTGATA/TTTTAAGTGACT...TGAAG|GAC | 2 | 1 | 23.105 |
| 9479259 | GT-AG | 0 | 0.0004230984504621 | 727 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 7 | 1455101 | 1455827 | Anhinga anhinga 56067 | GAA|GTACGTTAGA...TATTTTTTTTCT/GTTTATTTGAAA...CCTAG|GGA | 2 | 1 | 25.406 |
| 9479260 | GT-AG | 0 | 1.000000099473604e-05 | 2926 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 8 | 1452042 | 1454967 | Anhinga anhinga 56067 | AGA|GTAAGTAAAA...CACTTGTTAATA/CACTTGTTAATA...ATTAG|GAT | 0 | 1 | 27.836 |
| 9479261 | GT-AG | 0 | 1.000000099473604e-05 | 672 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 9 | 1451242 | 1451913 | Anhinga anhinga 56067 | AAG|GTAGGGTTTG...TCCTGCTTATTT/GTCCTGCTTATT...GCCAG|AGT | 2 | 1 | 30.174 |
| 9479262 | GT-AG | 0 | 1.000000099473604e-05 | 515 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 10 | 1450551 | 1451065 | Anhinga anhinga 56067 | AAG|GTAATGGCGT...TCATGTTTATTT/TTCATGTTTATT...TTCAG|ATA | 1 | 1 | 33.388 |
| 9479263 | GT-AG | 0 | 0.0025045192181798 | 731 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 11 | 1449716 | 1450446 | Anhinga anhinga 56067 | GCT|GTAAGTCTTG...GTTTTTTTAACA/GTTTTTTTAACA...TTTAG|ATC | 0 | 1 | 35.288 |
| 9479264 | GT-AG | 0 | 1.000000099473604e-05 | 993 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 12 | 1448630 | 1449622 | Anhinga anhinga 56067 | AAG|GTGAGTAGGA...TCAGCCATAATA/AAGGTGATCACT...GTTAG|GTT | 0 | 1 | 36.986 |
| 9479265 | GT-AG | 0 | 1.000000099473604e-05 | 3099 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 13 | 1444517 | 1447615 | Anhinga anhinga 56067 | CAG|GTAGAAACTT...TTCTTCTGAATC/TTTCTTCTGAAT...TGTAG|GTG | 0 | 1 | 55.507 |
| 9479266 | GT-AG | 0 | 1.000000099473604e-05 | 2510 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 14 | 1441868 | 1444377 | Anhinga anhinga 56067 | CAG|GTGAGTGACA...CTGTTTTTATCA/CCTGTTTTTATC...TTAAG|CAG | 1 | 1 | 58.046 |
| 9479267 | GT-AG | 0 | 0.0017834375199836 | 508 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 15 | 1441212 | 1441719 | Anhinga anhinga 56067 | CAG|GTGTCTTGAA...GGTGCTTTGCCC/CCAGTTCTAAGT...TGTAG|TTC | 2 | 1 | 60.749 |
| 9479268 | GT-AG | 0 | 1.000000099473604e-05 | 1058 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 16 | 1440032 | 1441089 | Anhinga anhinga 56067 | TTG|GTAAGTCACT...TTGCTTTTACTT/CTTTTACTTATG...AATAG|GTT | 1 | 1 | 62.977 |
| 9479269 | GT-AG | 0 | 1.000000099473604e-05 | 1260 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 17 | 1438682 | 1439941 | Anhinga anhinga 56067 | GAG|GTGAATTATT...CAGCTGTTAATT/CAGCTGTTAATT...CGTAG|GCT | 1 | 1 | 64.621 |
| 9479270 | GT-AG | 0 | 1.000000099473604e-05 | 3931 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 18 | 1434582 | 1438512 | Anhinga anhinga 56067 | TAG|GTAAGGTCCC...TCAGTTTTAACT/TCAGTTTTAACT...AAAAG|GCC | 2 | 1 | 67.708 |
| 9479271 | GT-AG | 0 | 1.000000099473604e-05 | 915 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 19 | 1433504 | 1434418 | Anhinga anhinga 56067 | GAG|GTAATTGTTT...GAAAACTTGACA/GAAAACTTGACA...CATAG|TTT | 0 | 1 | 70.685 |
| 9479272 | GT-AG | 0 | 1.000000099473604e-05 | 593 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 20 | 1432777 | 1433369 | Anhinga anhinga 56067 | AAG|GTAAGGCTGG...TTCTTTTTATTT/TTTCTTTTTATT...TTTAG|TTC | 2 | 1 | 73.132 |
| 9479273 | GT-AG | 0 | 1.000000099473604e-05 | 866 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 21 | 1431746 | 1432611 | Anhinga anhinga 56067 | AAG|GTAGAGAGTA...CGAGTTTTAAAA/CGAGTTTTAAAA...TAAAG|GGG | 2 | 1 | 76.146 |
| 9479274 | GT-AG | 0 | 1.000000099473604e-05 | 974 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 22 | 1430588 | 1431561 | Anhinga anhinga 56067 | GAG|GTAAGCTATT...CTTTCTGTGTTT/TACAATTTCACA...TGCAG|TAT | 0 | 1 | 79.507 |
| 9479275 | GT-AG | 0 | 4.17928055965549e-05 | 298 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 23 | 1430141 | 1430438 | Anhinga anhinga 56067 | CAG|GTATTTGAGC...TTCCCTTGAGTT/CTTGAGTTGATG...TCCAG|AGC | 2 | 1 | 82.228 |
| 9479276 | GT-AG | 0 | 1.000000099473604e-05 | 1143 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 24 | 1428827 | 1429969 | Anhinga anhinga 56067 | CAT|GTGAGTGACC...TTAAGTTTAAAT/TTAAGTTTAAAT...TATAG|GAT | 2 | 1 | 85.352 |
| 9479277 | GT-AG | 0 | 1.000000099473604e-05 | 714 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 25 | 1427904 | 1428617 | Anhinga anhinga 56067 | TTT|GTTATTCTTG...ATACTCTCATAA/AATACTCTCATA...TGTAG|ATT | 1 | 1 | 89.169 |
| 9479278 | GT-AG | 0 | 1.000000099473604e-05 | 182 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 26 | 1427578 | 1427759 | Anhinga anhinga 56067 | TTG|GTAAGTATGC...TGTGCCTAAGGA/CTGTGCCTAAGG...AACAG|AGG | 1 | 1 | 91.799 |
| 9479279 | GT-AG | 0 | 1.000000099473604e-05 | 790 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 27 | 1426660 | 1427449 | Anhinga anhinga 56067 | AAG|GTGAGTCACC...TGCTCTTTAACT/CTTTAACTGAAT...CTTAG|GAT | 0 | 1 | 94.137 |
| 9479280 | GT-AG | 0 | 1.2912094040305653e-05 | 1784 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 28 | 1424766 | 1426549 | Anhinga anhinga 56067 | CAA|GTAAGTCTGT...TTTTTCTTTTTT/AAATTACTCATA...TGCAG|AGA | 2 | 1 | 96.146 |
| 9479281 | GT-AG | 0 | 1.000000099473604e-05 | 2388 | rna-gnl|WGS:WBMU|ANHANH_R12137_mrna 1668849 | 29 | 1422268 | 1424655 | Anhinga anhinga 56067 | CAG|GTTAGTATTT...GGGTTCTTTTTC/TCCAGTCTCATT...TCCAG|GCA | 1 | 1 | 98.155 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);