introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
24 rows where transcript_id = 1341749
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 7217469 | GT-AG | 0 | 0.0090726877422832 | 4983 | rna-XM_011630541.2 1341749 | 2 | 3004372 | 3009354 | Amborella trichopoda 13333 | GAG|GTACCATTAG...TTCCTATTGATT/TTCCTATTGATT...ACTAG|AAT | 0 | 1 | 6.656 |
| 7217470 | GT-AG | 0 | 0.0009958575499306 | 2137 | rna-XM_011630541.2 1341749 | 3 | 3002048 | 3004184 | Amborella trichopoda 13333 | ATG|GTACGTTTTT...TATTCTGTATCC/GTGATATTTATT...TGCAG|AAC | 1 | 1 | 12.446 |
| 7217471 | GT-AG | 0 | 0.0388822968046383 | 114 | rna-XM_011630541.2 1341749 | 4 | 3001721 | 3001834 | Amborella trichopoda 13333 | AAG|GTTTCTAGGA...CTTTCTTTGATT/CTTTGATTTATT...CCCAG|CTA | 1 | 1 | 19.04 |
| 7217472 | GT-AG | 0 | 3.901704852748791e-05 | 165 | rna-XM_011630541.2 1341749 | 5 | 3001416 | 3001580 | Amborella trichopoda 13333 | CAG|GTACTATACA...CATTTTTTATTT/GCATTTTTTATT...CACAG|ATG | 0 | 1 | 23.375 |
| 7217473 | GT-AG | 0 | 4.121192582528524e-05 | 2064 | rna-XM_011630541.2 1341749 | 6 | 2999256 | 3001319 | Amborella trichopoda 13333 | GAT|GTAAGTTTGC...AGATTCTAAGTG/TAAGTGTTTATT...TGTAG|GTT | 0 | 1 | 26.347 |
| 7217474 | GC-AG | 0 | 1.000000099473604e-05 | 406 | rna-XM_011630541.2 1341749 | 7 | 2998782 | 2999187 | Amborella trichopoda 13333 | CAG|GCAAGAGCTT...ATTTTTTCAATC/AATTTTTTCAAT...TGCAG|CGA | 2 | 1 | 28.452 |
| 7217475 | GT-AG | 0 | 4.566098423729214e-05 | 282 | rna-XM_011630541.2 1341749 | 8 | 2998398 | 2998679 | Amborella trichopoda 13333 | CAG|GTAATCTCTT...GCTTCCCTCATG/GCTTCCCTCATG...TCAAG|CAA | 2 | 1 | 31.61 |
| 7217476 | GT-AG | 0 | 1.978912580169535e-05 | 13460 | rna-XM_011630541.2 1341749 | 9 | 2984799 | 2998258 | Amborella trichopoda 13333 | AAG|GTATGGATGT...GTCTCTATGATA/TCTATGATAACC...TTCAG|GTT | 0 | 1 | 35.913 |
| 7217477 | GT-AG | 0 | 1.8590305387177965e-05 | 108 | rna-XM_011630541.2 1341749 | 10 | 2984547 | 2984654 | Amborella trichopoda 13333 | GAG|GTTTACGAGA...CGATTTTTAAGT/CTTCTGTTTATT...TGCAG|GTT | 0 | 1 | 40.372 |
| 7217478 | GT-AG | 0 | 0.0015908928632967 | 929 | rna-XM_011630541.2 1341749 | 11 | 2983498 | 2984426 | Amborella trichopoda 13333 | CAG|GTAACTGCTT...TTCCTCTTAACA/ATGCATTTGATT...TACAG|GCT | 0 | 1 | 44.087 |
| 7217479 | GT-AG | 0 | 1.000000099473604e-05 | 1581 | rna-XM_011630541.2 1341749 | 12 | 2981860 | 2983440 | Amborella trichopoda 13333 | AAG|GTAAGGTGTT...CATACTTTATTA/TTTTCTTTCATA...TGCAG|GTG | 0 | 1 | 45.851 |
| 7217480 | GT-AG | 0 | 1.000000099473604e-05 | 148 | rna-XM_011630541.2 1341749 | 13 | 2981652 | 2981799 | Amborella trichopoda 13333 | CAG|GTGAATATAA...GTTTTCTCATTG/CGTTTTCTCATT...TATAG|GAT | 0 | 1 | 47.709 |
| 7217481 | GT-AG | 0 | 1.000000099473604e-05 | 2721 | rna-XM_011630541.2 1341749 | 14 | 2978823 | 2981543 | Amborella trichopoda 13333 | AAG|GTTGGGCCTC...CATTTTGTAATT/CATTTTGTAATT...TTTAG|ATT | 0 | 1 | 51.053 |
| 7217482 | GT-AG | 0 | 0.0002871858397671 | 620 | rna-XM_011630541.2 1341749 | 15 | 2978042 | 2978661 | Amborella trichopoda 13333 | AAG|GTAATCTTAC...GACATTTTGAAG/GATCTTTTAAGA...TGCAG|TTA | 2 | 1 | 56.037 |
| 7217483 | GT-AG | 0 | 9.964800218139716e-05 | 812 | rna-XM_011630541.2 1341749 | 16 | 2977151 | 2977962 | Amborella trichopoda 13333 | CAG|GTATGATACC...ACTTTCTTGTTT/CATTTTGTCAAT...ACTAG|CAA | 0 | 1 | 58.483 |
| 7217484 | GT-AG | 0 | 1.000000099473604e-05 | 181 | rna-XM_011630541.2 1341749 | 17 | 2976907 | 2977087 | Amborella trichopoda 13333 | CAG|GTGTGTGATT...ATTACTTTGATC/CATTTATTGATA...TGCAG|TAC | 0 | 1 | 60.433 |
| 7217485 | GT-AG | 0 | 0.7281635448571669 | 251 | rna-XM_011630541.2 1341749 | 18 | 2976525 | 2976775 | Amborella trichopoda 13333 | GAA|GTAACTTTTT...GTAACTTTGATC/GATCGTTTCACT...TTCAG|GAA | 2 | 1 | 64.489 |
| 7217486 | GT-AG | 0 | 1.000000099473604e-05 | 525 | rna-XM_011630541.2 1341749 | 19 | 2975894 | 2976418 | Amborella trichopoda 13333 | CAG|GTGGGTTGGA...TGTACTTTGAGA/TAGAGGCTAATG...GACAG|GAG | 0 | 1 | 67.771 |
| 7217487 | GT-AG | 0 | 0.0003305294586747 | 142 | rna-XM_011630541.2 1341749 | 21 | 2974410 | 2974551 | Amborella trichopoda 13333 | AAA|GTTTGTTCCT...GAAACCTTCACA/ACATTTCTGATT...CCTAG|GTA | 0 | 1 | 73.808 |
| 7217488 | GT-AG | 0 | 1.000000099473604e-05 | 986 | rna-XM_011630541.2 1341749 | 22 | 2973196 | 2974181 | Amborella trichopoda 13333 | CGG|GTGAGCTTTT...TTTTTTGTAAAG/TATGCATTCATT...TGTAG|GAA | 0 | 1 | 80.867 |
| 7217489 | GT-AG | 0 | 1.7480884609210057e-05 | 666 | rna-XM_011630541.2 1341749 | 23 | 2972374 | 2973039 | Amborella trichopoda 13333 | GGG|GTATTGGGCA...TTGTTTCTAACA/TTGTTTCTAACA...TGAAG|GGA | 0 | 1 | 85.697 |
| 7217490 | GT-AG | 0 | 1.000000099473604e-05 | 461 | rna-XM_011630541.2 1341749 | 24 | 2971754 | 2972214 | Amborella trichopoda 13333 | CAG|GTAATTGCCA...ATTTCTTTTGTT/CTAAAACTAAAC...GGCAG|GCA | 0 | 1 | 90.619 |
| 7217491 | GT-AG | 0 | 1.000000099473604e-05 | 1147 | rna-XM_011630541.2 1341749 | 25 | 2970475 | 2971621 | Amborella trichopoda 13333 | AAG|GTTCATAACT...AATATCATGATT/CTCTCTCTCACA...TGTAG|CTC | 0 | 1 | 94.706 |
| 7218059 | GT-AG | 0 | 0.0001537876211421 | 981 | rna-XM_011630541.2 1341749 | 1 | 3009406 | 3010386 | Amborella trichopoda 13333 | GAA|GTAAGTTTCG...CTACTCTTGTTT/AAATATTTAAGC...TGCAG|AAC | 0 | 6.471 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);