introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
31 rows where transcript_id = 20429258
This data as json, CSV (advanced)
Suggested facets: is_minor, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 109622325 | GT-AG | 0 | 0.0931871289139592 | 99 | rna-XM_042646780.1 20429258 | 1 | 12498154 | 12498252 | Macadamia integrifolia 60698 | ATG|GTATGCTCTT...TGGGTTTTGATT/TGGGTTTTGATT...GCTAG|GTA | 0 | 1 | 2.372 |
| 109622326 | GT-AG | 0 | 3.833296157750493e-05 | 2012 | rna-XM_042646780.1 20429258 | 2 | 12496016 | 12498027 | Macadamia integrifolia 60698 | GAT|GTAAGTACTT...TATTTTTTACTT/TTTTTACTTATC...TTCAG|ATT | 0 | 1 | 4.586 |
| 109622327 | GT-AG | 0 | 0.0011680888125918 | 13250 | rna-XM_042646780.1 20429258 | 3 | 12482724 | 12495973 | Macadamia integrifolia 60698 | ATG|GTAGATTTTT...TGGTTCTTATTT/TATTTTTTTATT...TTTAG|GCG | 0 | 1 | 5.324 |
| 109622328 | GT-AG | 0 | 1.000000099473604e-05 | 108 | rna-XM_042646780.1 20429258 | 4 | 12482556 | 12482663 | Macadamia integrifolia 60698 | GAG|GTGGGTTTAG...TAGTTTTTATTT/GTAGTTTTTATT...TGTAG|GAA | 0 | 1 | 6.378 |
| 109622329 | GT-AG | 0 | 1.000000099473604e-05 | 916 | rna-XM_042646780.1 20429258 | 5 | 12481577 | 12482492 | Macadamia integrifolia 60698 | GAG|GTAAGCATAT...TATTTTTTTCCA/TTTTTTTCCATT...TGCAG|GTG | 0 | 1 | 7.486 |
| 109622330 | GT-AG | 0 | 0.0001194211300674 | 241 | rna-XM_042646780.1 20429258 | 6 | 12481255 | 12481495 | Macadamia integrifolia 60698 | CTG|GTATGGTTAT...CTTTCCTTCCCG/CCGCTATTCACT...TGTAG|GTC | 0 | 1 | 8.909 |
| 109622331 | GT-AG | 0 | 1.000000099473604e-05 | 105 | rna-XM_042646780.1 20429258 | 7 | 12481075 | 12481179 | Macadamia integrifolia 60698 | AAG|GTCTGACCTC...TTTTTTTTAATG/TTTTTTTTAATG...TTCAG|TCA | 0 | 1 | 10.227 |
| 109622332 | GT-AG | 0 | 1.501272384192026e-05 | 178 | rna-XM_042646780.1 20429258 | 8 | 12480849 | 12481026 | Macadamia integrifolia 60698 | ATT|GTAAGTGGTT...GATATTTTATTT/TAAATTCTCATG...ATCAG|GTT | 0 | 1 | 11.07 |
| 109622333 | GT-AG | 0 | 1.000000099473604e-05 | 100 | rna-XM_042646780.1 20429258 | 9 | 12480680 | 12480779 | Macadamia integrifolia 60698 | AAG|GTAGGGAGGT...TCCTCATTAATT/CGTTTCCTCATT...TACAG|TCC | 0 | 1 | 12.283 |
| 109622334 | GT-AG | 0 | 3.864789175762372e-05 | 10083 | rna-XM_042646780.1 20429258 | 10 | 12470463 | 12480545 | Macadamia integrifolia 60698 | CAT|GTAAGTATTG...CATTCTGTAATA/AATGTTTTCATT...TTCAG|ATA | 2 | 1 | 14.637 |
| 109622335 | GT-AG | 0 | 2.352519966041517e-05 | 162 | rna-XM_042646780.1 20429258 | 11 | 12470225 | 12470386 | Macadamia integrifolia 60698 | GAG|GTATATGAAT...TGGTGCTTATAC/TTGGTGCTTATA...TGCAG|GCT | 0 | 1 | 15.973 |
| 109622336 | GT-AG | 0 | 1.000000099473604e-05 | 685 | rna-XM_042646780.1 20429258 | 12 | 12469346 | 12470030 | Macadamia integrifolia 60698 | CTG|GTAAGTAGTT...ACCTTTTTACTT/TACCTTTTTACT...TTCAG|GTA | 2 | 1 | 19.381 |
| 109622337 | GT-AG | 0 | 0.0009281511738304 | 382 | rna-XM_042646780.1 20429258 | 13 | 12468883 | 12469264 | Macadamia integrifolia 60698 | TAG|GTAATCTTCA...AGTTTTTTATTA/TAGTTTTTTATT...TCTAG|GAT | 2 | 1 | 20.805 |
| 109622338 | GT-AG | 0 | 0.0122335750788269 | 113 | rna-XM_042646780.1 20429258 | 14 | 12468559 | 12468671 | Macadamia integrifolia 60698 | AAG|GTATACCGAA...TTTTCTTTATTC/GTTTTCTTTATT...GGAAG|GTC | 0 | 1 | 24.512 |
| 109622339 | GT-AG | 0 | 1.000000099473604e-05 | 86 | rna-XM_042646780.1 20429258 | 15 | 12468239 | 12468324 | Macadamia integrifolia 60698 | GAG|GTCTGGATTA...ATTTGCTTGATG/TTGATATTTATA...CTTAG|GTG | 0 | 1 | 28.624 |
| 109622340 | GT-AG | 0 | 1.073028838791684e-05 | 92 | rna-XM_042646780.1 20429258 | 16 | 12468066 | 12468157 | Macadamia integrifolia 60698 | AAG|GTGCTTTAAT...TGTACCTTATCT/GTGTACCTTATC...TACAG|TTG | 0 | 1 | 30.047 |
| 109622341 | GT-AG | 0 | 1.000000099473604e-05 | 90 | rna-XM_042646780.1 20429258 | 17 | 12467856 | 12467945 | Macadamia integrifolia 60698 | CAG|GTACATACTT...AATGACTTTGTG/AGTCTACTAAAG...TACAG|ATA | 0 | 1 | 32.156 |
| 109622342 | GT-AG | 0 | 1.000000099473604e-05 | 3924 | rna-XM_042646780.1 20429258 | 18 | 12463872 | 12467795 | Macadamia integrifolia 60698 | CAG|GTGTGTGATT...TCATTCTTATTG/ATCATTCTTATT...TGCAG|CTT | 0 | 1 | 33.21 |
| 109622343 | GT-AG | 0 | 0.0386726388329896 | 1453 | rna-XM_042646780.1 20429258 | 19 | 12462291 | 12463743 | Macadamia integrifolia 60698 | CTT|GTACGTTTCA...GTTCTTTTGACC/GTTCTTTTGACC...TGCAG|GTG | 2 | 1 | 35.459 |
| 109622344 | GT-AG | 0 | 1.000000099473604e-05 | 83 | rna-XM_042646780.1 20429258 | 20 | 12462111 | 12462193 | Macadamia integrifolia 60698 | CAG|GTGTTTAAAA...GTGCCATTATCA/TGCATTCTAATT...TACAG|GAG | 0 | 1 | 37.164 |
| 109622345 | GT-AG | 0 | 1.000000099473604e-05 | 2937 | rna-XM_042646780.1 20429258 | 21 | 12459045 | 12461981 | Macadamia integrifolia 60698 | AAG|GTGCTTGCTG...AAGGCTTTAAAA/TAAAATTTAATT...TGCAG|GCT | 0 | 1 | 39.431 |
| 109622346 | GT-AG | 0 | 1.000000099473604e-05 | 98 | rna-XM_042646780.1 20429258 | 22 | 12458843 | 12458940 | Macadamia integrifolia 60698 | TGT|GTGAGTTAGA...AATGATTTAATG/ATGTAGCTAATT...TTCAG|GGT | 2 | 1 | 41.258 |
| 109622347 | GT-AG | 0 | 0.0001112571029989 | 677 | rna-XM_042646780.1 20429258 | 23 | 12457985 | 12458661 | Macadamia integrifolia 60698 | GAG|GTATGGTGTT...GTTATTTTACTG/CGTTATTTTACT...TCCAG|GTT | 0 | 1 | 44.439 |
| 109622348 | GT-AG | 0 | 1.000000099473604e-05 | 204 | rna-XM_042646780.1 20429258 | 24 | 12457602 | 12457805 | Macadamia integrifolia 60698 | CAA|GTAATACTCT...TATATTTTATTA/TATTATTTAATT...TACAG|GTG | 2 | 1 | 47.584 |
| 109622349 | GT-AG | 0 | 0.0142476638781556 | 135 | rna-XM_042646780.1 20429258 | 25 | 12456860 | 12456994 | Macadamia integrifolia 60698 | AAG|GTACTCTTCA...ACCTTTTTATTA/TTGTACTTCACT...TTCAG|AGT | 0 | 1 | 58.25 |
| 109622350 | GT-AG | 0 | 1.000000099473604e-05 | 3676 | rna-XM_042646780.1 20429258 | 26 | 12453079 | 12456754 | Macadamia integrifolia 60698 | AAG|GTGAGTTTTT...CTGGCTTTGATA/TTTTCTTTTACT...GAAAG|GTA | 0 | 1 | 60.095 |
| 109622351 | GT-AG | 0 | 1.000000099473604e-05 | 351 | rna-XM_042646780.1 20429258 | 27 | 12452572 | 12452922 | Macadamia integrifolia 60698 | CGG|GTAAGAGATG...CTTGTCTTATAT/ACTTGTCTTATA...TATAG|GTA | 0 | 1 | 62.836 |
| 109622352 | GT-AG | 0 | 1.807356775599748e-05 | 137 | rna-XM_042646780.1 20429258 | 28 | 12452354 | 12452490 | Macadamia integrifolia 60698 | AAG|GTAATTTTAA...CATCCCTTCAAA/ATCAATTTTACA...TTCAG|AGT | 0 | 1 | 64.259 |
| 109622353 | GT-AG | 0 | 1.646590861668608e-05 | 2598 | rna-XM_042646780.1 20429258 | 29 | 12449291 | 12451888 | Macadamia integrifolia 60698 | GAG|GTAGGTCATC...AATTTCTTAGTT/ATGTGATTAATT...GGCAG|TTG | 0 | 1 | 72.43 |
| 109622354 | GT-AG | 0 | 0.0232256527131332 | 3855 | rna-XM_042646780.1 20429258 | 30 | 12444233 | 12448087 | Macadamia integrifolia 60698 | AAG|GTATGCTATG...TTGGTTTTAATC/TTGGTTTTAATC...TGTAG|GTG | 0 | 1 | 93.569 |
| 109622355 | GT-AG | 1 | 99.18739637441185 | 133 | rna-XM_042646780.1 20429258 | 31 | 12443804 | 12443936 | Macadamia integrifolia 60698 | TCA|GTATCCTTAA...TTATCCTTGACC/TTGCAGCTAATT...TTTAG|GAT | 2 | 1 | 98.77 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);