introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
36 rows where transcript_id = 10378392
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 57022080 | GT-AG | 0 | 1.000000099473604e-05 | 1610 | rna-XM_025134491.1 10378392 | 1 | 3229296 | 3230905 | Cynara cardunculus 4265 | TCG|GTAAATCAAC...TTTTCTTTGTTT/TGTTTACTTACA...AACAG|GAT | 2 | 1 | 4.263 |
| 57022081 | GT-AG | 0 | 1.8234876936133728 | 142 | rna-XM_025134491.1 10378392 | 2 | 3229090 | 3229231 | Cynara cardunculus 4265 | ATG|GTATCTTCTT...TATGTCTTTATG/TGCTTGTTGATT...TTTAG|GCC | 0 | 1 | 5.434 |
| 57022082 | GT-AG | 0 | 1.000000099473604e-05 | 1529 | rna-XM_025134491.1 10378392 | 3 | 3227476 | 3229004 | Cynara cardunculus 4265 | CAG|GTGCGTAGAA...TGAGTTTTAGTT/TTTACTCTGACT...GGCAG|GCG | 1 | 1 | 6.989 |
| 57022083 | GT-AG | 0 | 1.000000099473604e-05 | 156 | rna-XM_025134491.1 10378392 | 4 | 3227033 | 3227188 | Cynara cardunculus 4265 | CAG|GTGTGTATAA...TTCCTGTTAATT/TTCCTGTTAATT...GCTAG|CCA | 0 | 1 | 12.239 |
| 57022084 | GT-AG | 0 | 6.260649472649597e-05 | 106 | rna-XM_025134491.1 10378392 | 5 | 3226821 | 3226926 | Cynara cardunculus 4265 | AAG|GTTTTCACTT...TTGCTTTTCATT/TTGCTTTTCATT...TTCAG|GAA | 1 | 1 | 14.179 |
| 57022085 | GT-AG | 0 | 1.000000099473604e-05 | 1828 | rna-XM_025134491.1 10378392 | 6 | 3224844 | 3226671 | Cynara cardunculus 4265 | GAG|GTGACACATT...GTTTCTTTGAAA/TTTGAAATGATT...TGCAG|GTC | 0 | 1 | 16.905 |
| 57022086 | GT-AG | 0 | 1.000000099473604e-05 | 105 | rna-XM_025134491.1 10378392 | 7 | 3224551 | 3224655 | Cynara cardunculus 4265 | AAG|GTAGGGGTTG...TCGTTGTTATTT/CATTGCTTCACT...TCCAG|AGT | 2 | 1 | 20.344 |
| 57022087 | GT-AG | 0 | 0.0001609719897623 | 1666 | rna-XM_025134491.1 10378392 | 8 | 3222815 | 3224480 | Cynara cardunculus 4265 | TAT|GTAAGTTGGT...AATATTTTATTT/AAATATTTTATT...ATCAG|TCA | 0 | 1 | 21.625 |
| 57022088 | GT-AG | 0 | 1.000000099473604e-05 | 114 | rna-XM_025134491.1 10378392 | 9 | 3222530 | 3222643 | Cynara cardunculus 4265 | CAG|GTTTGATATC...TCATTTTTATTT/TTACTTCTCATT...TACAG|ATG | 0 | 1 | 24.753 |
| 57022089 | GT-AG | 0 | 0.0123527080013171 | 2518 | rna-XM_025134491.1 10378392 | 10 | 3219524 | 3222041 | Cynara cardunculus 4265 | AAG|GTAACCTCCC...GGAATTTTAGAT/TTAGATCTAATG...TGCAG|GGA | 2 | 1 | 33.681 |
| 57022090 | GT-AG | 0 | 1.000000099473604e-05 | 221 | rna-XM_025134491.1 10378392 | 11 | 3218843 | 3219063 | Cynara cardunculus 4265 | CAT|GTAAGTAATG...ATGTTGTTATCT/TCTATATTTATC...TATAG|GTA | 0 | 1 | 42.097 |
| 57022091 | GT-AG | 0 | 0.0268891414265005 | 315 | rna-XM_025134491.1 10378392 | 12 | 3218189 | 3218503 | Cynara cardunculus 4265 | GAG|GTATTTTTAG...GCTTCCTTCACT/TTTGAACTTACA...TGCAG|GTT | 0 | 1 | 48.299 |
| 57022092 | GT-AG | 0 | 6.489475890229895e-05 | 766 | rna-XM_025134491.1 10378392 | 13 | 3217339 | 3218104 | Cynara cardunculus 4265 | CAG|GTAATCTATT...TTTGTCATAAAA/CGCTTTGTCATA...TGCAG|GCT | 0 | 1 | 49.835 |
| 57022093 | GT-AG | 0 | 1.000000099473604e-05 | 170 | rna-XM_025134491.1 10378392 | 14 | 3217076 | 3217245 | Cynara cardunculus 4265 | GAG|GTGACACCCT...CTACTTTTAAAT/CTACTTTTAAAT...TGCAG|GCA | 0 | 1 | 51.537 |
| 57022094 | GT-AG | 0 | 0.0012847748418309 | 94 | rna-XM_025134491.1 10378392 | 15 | 3216922 | 3217015 | Cynara cardunculus 4265 | ACT|GTAAGCTCGT...TTTTATTTAATT/TTTTATTTAATT...TGTAG|GTT | 0 | 1 | 52.634 |
| 57022095 | GT-AG | 0 | 1.000000099473604e-05 | 764 | rna-XM_025134491.1 10378392 | 16 | 3216059 | 3216822 | Cynara cardunculus 4265 | CAG|GTTAATTGTT...CTAGTCTTACAT/CCTAGTCTTACA...TTCAG|GGA | 0 | 1 | 54.446 |
| 57022096 | GT-AG | 0 | 1.000000099473604e-05 | 926 | rna-XM_025134491.1 10378392 | 17 | 3214992 | 3215917 | Cynara cardunculus 4265 | CAG|GTATGGTGTT...ATATTCTCGGCA/GCAAATCTAATG...TTTAG|GAA | 0 | 1 | 57.025 |
| 57022097 | GT-AG | 0 | 1.000000099473604e-05 | 546 | rna-XM_025134491.1 10378392 | 18 | 3214365 | 3214910 | Cynara cardunculus 4265 | AAG|GTTAGTATCT...TGATGCTTAACA/TTGTTGTTGATG...TGCAG|TCT | 0 | 1 | 58.507 |
| 57022098 | GT-AG | 0 | 1.000000099473604e-05 | 471 | rna-XM_025134491.1 10378392 | 19 | 3213822 | 3214292 | Cynara cardunculus 4265 | CAG|GTAATGCATA...TAATTTTTATTT/TTTTATTTCACC...TTCAG|TCT | 0 | 1 | 59.824 |
| 57022099 | GT-AG | 0 | 1.000000099473604e-05 | 779 | rna-XM_025134491.1 10378392 | 20 | 3212971 | 3213749 | Cynara cardunculus 4265 | CTG|GTGAGATGTT...TCATCTTTACCA/CACTTTCTAAAT...TTCAG|GGT | 0 | 1 | 61.142 |
| 57022100 | GT-AG | 0 | 0.0035717363700198 | 70 | rna-XM_025134491.1 10378392 | 21 | 3212807 | 3212876 | Cynara cardunculus 4265 | TGG|GTATGTTGTT...TGTTGCTTACCA/TTGTTGCTTACC...TGAAG|GTC | 1 | 1 | 62.861 |
| 57022101 | GT-AG | 0 | 0.0023522971943865 | 123 | rna-XM_025134491.1 10378392 | 22 | 3212601 | 3212723 | Cynara cardunculus 4265 | AAT|GTAACTGCTG...GTTGCTTTGGCT/TAGGAGTTCATG...CTCAG|GAA | 0 | 1 | 64.38 |
| 57022102 | GT-AG | 0 | 1.000000099473604e-05 | 109 | rna-XM_025134491.1 10378392 | 23 | 3212435 | 3212543 | Cynara cardunculus 4265 | CAG|GTTGTGCATT...TTTGCTTTTGTG/AGGTTACTCATT...TGCAG|ATG | 0 | 1 | 65.423 |
| 57022103 | GT-AG | 0 | 0.0011917565531687 | 136 | rna-XM_025134491.1 10378392 | 24 | 3212221 | 3212356 | Cynara cardunculus 4265 | AAG|GTAACTCTTT...TTTGCATTATTG/AAAATTCTCATT...TGCAG|ACT | 0 | 1 | 66.85 |
| 57022104 | GT-AG | 0 | 1.000000099473604e-05 | 76 | rna-XM_025134491.1 10378392 | 25 | 3212023 | 3212098 | Cynara cardunculus 4265 | CAG|GTAGTAGTAT...TCTACTTTAACA/TCTACTTTAACA...TTCAG|GAT | 2 | 1 | 69.082 |
| 57022105 | GT-AG | 0 | 9.455212250357738e-05 | 332 | rna-XM_025134491.1 10378392 | 26 | 3211530 | 3211861 | Cynara cardunculus 4265 | TTG|GTATGTGATC...TATCCATTATAT/ATTTCAGTTATC...TCCAG|GGT | 1 | 1 | 72.027 |
| 57022106 | GT-AG | 0 | 0.0005672029114831 | 76 | rna-XM_025134491.1 10378392 | 27 | 3211377 | 3211452 | Cynara cardunculus 4265 | GAG|GTATGACTCT...TTCTTCTTATTT/TCTGTATTCATT...CGCAG|ATA | 0 | 1 | 73.436 |
| 57022107 | GT-AG | 0 | 1.000000099473604e-05 | 93 | rna-XM_025134491.1 10378392 | 28 | 3211170 | 3211262 | Cynara cardunculus 4265 | GTG|GTAAGTGCAA...TTTTTCTTGTCA/TTGTCACTGATT...ATCAG|GTT | 0 | 1 | 75.521 |
| 57022108 | GT-AG | 0 | 0.0021701716169401 | 169 | rna-XM_025134491.1 10378392 | 29 | 3210899 | 3211067 | Cynara cardunculus 4265 | GCT|GTAAGTTCTC...TATATCTTAATC/TATATCTTAATC...TTCAG|GTG | 0 | 1 | 77.387 |
| 57022109 | GT-AG | 0 | 0.0004347712978094 | 1164 | rna-XM_025134491.1 10378392 | 30 | 3209594 | 3210757 | Cynara cardunculus 4265 | CAG|GTTCTCTCTC...TTCTCCATAACC/GTATTTTCCATC...TACAG|GCA | 0 | 1 | 79.967 |
| 57022110 | GT-AG | 0 | 0.0001215160024165 | 6088 | rna-XM_025134491.1 10378392 | 31 | 3203332 | 3209419 | Cynara cardunculus 4265 | CTT|GTAAGTTGCA...ATTTTCCTGATA/ATTTTCCTGATA...TGCAG|TTA | 0 | 1 | 83.15 |
| 57022111 | GT-AG | 0 | 1.000000099473604e-05 | 74 | rna-XM_025134491.1 10378392 | 32 | 3203194 | 3203267 | Cynara cardunculus 4265 | AAG|GTTGGTACAC...CATGTTTTGAAT/CATGTTTTGAAT...TACAG|AGA | 1 | 1 | 84.321 |
| 57022112 | GT-AG | 0 | 1.000000099473604e-05 | 83 | rna-XM_025134491.1 10378392 | 33 | 3202971 | 3203053 | Cynara cardunculus 4265 | AAG|GTCTGAGCTT...GTCTCTTTCACA/ACTGAATTCACA...TATAG|GTG | 0 | 1 | 86.883 |
| 57022113 | GT-AG | 0 | 0.0033214881635055 | 735 | rna-XM_025134491.1 10378392 | 34 | 3202084 | 3202818 | Cynara cardunculus 4265 | CAA|GTATGTATGC...TGATTATTAGCA/CAAGTGATTATT...TGCAG|ATC | 2 | 1 | 89.663 |
| 57022114 | GT-AG | 0 | 3.4694804343047743e-05 | 187 | rna-XM_025134491.1 10378392 | 35 | 3201818 | 3202004 | Cynara cardunculus 4265 | CAG|GTATATGGTT...TGTTCAGTGATA/TGCATGTTCAGT...TGCAG|GTC | 0 | 1 | 91.109 |
| 57022115 | GT-AG | 0 | 1.000000099473604e-05 | 689 | rna-XM_025134491.1 10378392 | 36 | 3201030 | 3201718 | Cynara cardunculus 4265 | AAG|GTAATAGTTA...TTTTTCTTCCTG/TATTTGGTGACA...TGCAG|CTT | 0 | 1 | 92.92 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);