introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
37 rows where transcript_id = 14424048
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, length, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 77101736 | GT-AG | 0 | 8.253200734223339e-05 | 104 | rna-XM_006412319.2 14424048 | 1 | 2844259 | 2844362 | Eutrema salsugineum 72664 | GAG|GTTTGATTAA...TATTTCTTAGCC/CTATTTCTTAGC...TTAAG|GTT | 0 | 1 | 3.145 |
| 77101737 | GT-AG | 0 | 1.000000099473604e-05 | 1114 | rna-XM_006412319.2 14424048 | 2 | 2844510 | 2845623 | Eutrema salsugineum 72664 | TAC|GTGAGTTCTT...TCGTTCCTGACT/TCGTTCCTGACT...TGTAG|ACA | 0 | 1 | 6.356 |
| 77101738 | GT-AG | 0 | 1.000000099473604e-05 | 92 | rna-XM_006412319.2 14424048 | 3 | 2845770 | 2845861 | Eutrema salsugineum 72664 | CAG|GTGGTCCACT...TATTCTTTATAA/TTATTCTTTATA...TCCAG|AGC | 2 | 1 | 9.546 |
| 77101739 | GT-AG | 0 | 3.732543809442299e-05 | 175 | rna-XM_006412319.2 14424048 | 4 | 2846019 | 2846193 | Eutrema salsugineum 72664 | GAA|GTAAGATTTT...GTGATCTTATAT/AGTGATCTTATA...TGCAG|TCA | 0 | 1 | 12.975 |
| 77101740 | GT-AG | 0 | 1.000000099473604e-05 | 653 | rna-XM_006412319.2 14424048 | 5 | 2846253 | 2846905 | Eutrema salsugineum 72664 | GAG|GTGGGTTATG...GATTTTTTATTT/TGATTTTTTATT...TTCAG|CCG | 2 | 1 | 14.264 |
| 77101741 | GT-AG | 0 | 5.216041214049808e-05 | 148 | rna-XM_006412319.2 14424048 | 6 | 2847063 | 2847210 | Eutrema salsugineum 72664 | AAT|GTAATTCTCT...ATGTTCTTGTTT/GAATTACTAATC...TATAG|GAC | 0 | 1 | 17.693 |
| 77101742 | GT-AG | 0 | 8.961837203503293e-05 | 90 | rna-XM_006412319.2 14424048 | 7 | 2847361 | 2847450 | Eutrema salsugineum 72664 | CAG|GTCTTTTGAC...CAGTTTTTGAAT/CAGTTTTTGAAT...ATCAG|GAA | 0 | 1 | 20.97 |
| 77101743 | GT-AG | 0 | 1.6209804850297124e-05 | 91 | rna-XM_006412319.2 14424048 | 8 | 2847588 | 2847678 | Eutrema salsugineum 72664 | CAA|GTACGTGGAA...GTCTGTTTATTT/TGTCTGTTTATT...ATCAG|ATG | 2 | 1 | 23.962 |
| 77101744 | GT-AG | 0 | 0.0024878725158278 | 91 | rna-XM_006412319.2 14424048 | 9 | 2847826 | 2847916 | Eutrema salsugineum 72664 | CTG|GTTTGCTTTA...TGTGTTTGAACT/TATTTATTTATG...GTCAG|GTT | 2 | 1 | 27.173 |
| 77101745 | GT-AG | 0 | 1.000000099473604e-05 | 214 | rna-XM_006412319.2 14424048 | 10 | 2848019 | 2848232 | Eutrema salsugineum 72664 | CAG|GTTGGCGATG...AATGTCTAAAAT/CAATGTCTAAAA...TGCAG|TTT | 2 | 1 | 29.401 |
| 77101746 | GT-AG | 0 | 1.000000099473604e-05 | 74 | rna-XM_006412319.2 14424048 | 11 | 2848291 | 2848364 | Eutrema salsugineum 72664 | GAG|GTAATATAGG...CTCTCCTTATGT/TCTCTCCTTATG...TTCAG|CAT | 0 | 1 | 30.668 |
| 77101747 | GT-AG | 0 | 1.000000099473604e-05 | 374 | rna-XM_006412319.2 14424048 | 12 | 2848467 | 2848840 | Eutrema salsugineum 72664 | AAG|GTGCGTTTCC...TTTATCTAAATT/ATTTATCTAAAT...TGCAG|AAA | 0 | 1 | 32.896 |
| 77101748 | GT-AG | 0 | 0.0006829730166915 | 86 | rna-XM_006412319.2 14424048 | 13 | 2848879 | 2848964 | Eutrema salsugineum 72664 | TTG|GTACAATATT...TTGTCTTTGATT/TTGTCTTTGATT...TGCAG|CAT | 2 | 1 | 33.727 |
| 77101749 | GC-AG | 0 | 1.000000099473604e-05 | 75 | rna-XM_006412319.2 14424048 | 14 | 2849092 | 2849166 | Eutrema salsugineum 72664 | AAG|GCAAGTCTTG...GTTTCCTTCAAA/TCAAATCTCAAT...TGCAG|GTT | 0 | 1 | 36.501 |
| 77101750 | GT-AG | 0 | 1.000000099473604e-05 | 129 | rna-XM_006412319.2 14424048 | 15 | 2849341 | 2849469 | Eutrema salsugineum 72664 | AAG|GTCAGCAAAC...TGATCATTAACC/ATTGGTCTGATC...TCTAG|CAA | 0 | 1 | 40.301 |
| 77101751 | GT-AG | 0 | 1.000000099473604e-05 | 574 | rna-XM_006412319.2 14424048 | 16 | 2849602 | 2850175 | Eutrema salsugineum 72664 | GGG|GTTAGTGTTT...GGTATTTTAACT/TTGTATCTCACA...GATAG|GGA | 0 | 1 | 43.185 |
| 77101752 | GT-AG | 0 | 1.000000099473604e-05 | 256 | rna-XM_006412319.2 14424048 | 17 | 2850286 | 2850541 | Eutrema salsugineum 72664 | GAG|GTGCGCATAT...TGAATTTTATCG/TTGTTATTAAAT...TGCAG|CAA | 2 | 1 | 45.588 |
| 77101753 | GT-AG | 0 | 0.0251494343440673 | 96 | rna-XM_006412319.2 14424048 | 18 | 2850603 | 2850698 | Eutrema salsugineum 72664 | CAG|GTATGCATCT...ATACTTTTAATT/TTTTAATTGATT...CACAG|CTA | 0 | 1 | 46.92 |
| 77101754 | GT-AG | 0 | 0.0004736512485417 | 265 | rna-XM_006412319.2 14424048 | 19 | 2850877 | 2851141 | Eutrema salsugineum 72664 | GAG|GTTCTTTTCT...TCTTTCTTATCT/TTCTTTCTTATC...GCAAG|GAT | 1 | 1 | 50.808 |
| 77101755 | GT-AG | 0 | 0.0035824315984317 | 181 | rna-XM_006412319.2 14424048 | 20 | 2851348 | 2851528 | Eutrema salsugineum 72664 | CAG|GTAACCTAGG...CAGTCTCTAATG/ATGCATTTCATT...CTTAG|GCT | 0 | 1 | 55.308 |
| 77101756 | GT-AG | 0 | 1.6210803736630726 | 96 | rna-XM_006412319.2 14424048 | 21 | 2851649 | 2851744 | Eutrema salsugineum 72664 | CAG|GTATCTTTTA...TGCACATTGATG/CTATCATTCAAA...TGCAG|GCT | 0 | 1 | 57.929 |
| 77101757 | GT-AG | 0 | 0.0119204958294771 | 79 | rna-XM_006412319.2 14424048 | 22 | 2851844 | 2851922 | Eutrema salsugineum 72664 | CGA|GTACGTTGCC...GATGTCTTGATA/GTTGTATTGATG...TACAG|ATA | 0 | 1 | 60.092 |
| 77101758 | GT-AG | 0 | 1.000000099473604e-05 | 536 | rna-XM_006412319.2 14424048 | 23 | 2852145 | 2852680 | Eutrema salsugineum 72664 | AAG|GTAATTTTCC...TTTCTGCTAACG/ATGTTGTTCATG...CACAG|AAT | 0 | 1 | 64.941 |
| 77101759 | GT-AG | 0 | 2.7672801158958804e-05 | 96 | rna-XM_006412319.2 14424048 | 24 | 2852821 | 2852916 | Eutrema salsugineum 72664 | AAG|GTTTGTAGTA...TTTTTCTTACAA/TTTTTTCTTACA...TTCAG|CCT | 2 | 1 | 67.999 |
| 77101760 | GT-AG | 0 | 0.000590088414448 | 181 | rna-XM_006412319.2 14424048 | 25 | 2853017 | 2853197 | Eutrema salsugineum 72664 | GAA|GTACGTAATC...TGGTCTTTACTT/TACTTTTTTACC...AACAG|AAA | 0 | 1 | 70.183 |
| 77101761 | GT-AG | 0 | 1.5309880098365725e-05 | 124 | rna-XM_006412319.2 14424048 | 26 | 2853249 | 2853372 | Eutrema salsugineum 72664 | TTT|GTAAGTAACT...AAGTCATTAATA/ACTTAAGTCATT...TACAG|GAG | 0 | 1 | 71.298 |
| 77101762 | GT-AG | 0 | 1.1264451952212736e-05 | 219 | rna-XM_006412319.2 14424048 | 27 | 2853463 | 2853681 | Eutrema salsugineum 72664 | CTG|GTACAATTGA...TGGATTGTGACA/TGGATTGTGACA...TGCAG|GAG | 0 | 1 | 73.263 |
| 77101763 | GT-AG | 0 | 1.000000099473604e-05 | 153 | rna-XM_006412319.2 14424048 | 28 | 2853856 | 2854008 | Eutrema salsugineum 72664 | AAG|GTGATTCTTA...TGTGCTATATAT/CAGGTAATAATT...TTCAG|GGT | 0 | 1 | 77.064 |
| 77101764 | GT-AG | 0 | 1.000000099473604e-05 | 173 | rna-XM_006412319.2 14424048 | 29 | 2854144 | 2854316 | Eutrema salsugineum 72664 | TAT|GTGAGTTCAA...TGTTTCTTGTGT/TGTAAACTAATA...ACCAG|GGA | 0 | 1 | 80.013 |
| 77101765 | GT-AG | 0 | 1.000000099473604e-05 | 390 | rna-XM_006412319.2 14424048 | 30 | 2854494 | 2854883 | Eutrema salsugineum 72664 | CAG|GTGAGAGTTC...GTTTCTTTAATG/TCTTTAATGATT...TCAAG|GCG | 0 | 1 | 83.879 |
| 77101766 | GT-AG | 0 | 0.0006657193667121 | 274 | rna-XM_006412319.2 14424048 | 31 | 2855022 | 2855295 | Eutrema salsugineum 72664 | CAT|GTATGTACAT...TTTTTCTAACCT/TTTTTTCTAACC...GGTAG|GTA | 0 | 1 | 86.894 |
| 77101767 | GT-AG | 0 | 1.000000099473604e-05 | 86 | rna-XM_006412319.2 14424048 | 32 | 2855367 | 2855452 | Eutrema salsugineum 72664 | CAG|GTAGATGTGA...GTGTTCTGCATA/GCATATATAATC...TTCAG|TCT | 2 | 1 | 88.445 |
| 77101768 | GT-AG | 0 | 0.0010543664404085 | 74 | rna-XM_006412319.2 14424048 | 33 | 2855553 | 2855626 | Eutrema salsugineum 72664 | GAG|GTACACCATG...GATGCTTTATAG/TGTTATCTAAAA...TACAG|TTT | 0 | 1 | 90.629 |
| 77101769 | GT-AG | 0 | 1.000000099473604e-05 | 108 | rna-XM_006412319.2 14424048 | 34 | 2855684 | 2855791 | Eutrema salsugineum 72664 | TTG|GTAAAATGCA...TTGCATTTAATT/TTGCATTTAATT...CCTAG|GTT | 0 | 1 | 91.874 |
| 77101770 | GT-AG | 0 | 0.0002627821845542 | 101 | rna-XM_006412319.2 14424048 | 35 | 2855849 | 2855949 | Eutrema salsugineum 72664 | CCG|GTACGTTATT...TCTATTTTATTC/CTCTATTTTATT...TATAG|GCA | 0 | 1 | 93.119 |
| 77101771 | GT-AG | 0 | 1.000000099473604e-05 | 86 | rna-XM_006412319.2 14424048 | 36 | 2856031 | 2856116 | Eutrema salsugineum 72664 | GAG|GTGAGTTAAT...ACAACCTTGATA/TGAATATTGAAA...TGTAG|GTG | 0 | 1 | 94.889 |
| 77101772 | GT-AG | 0 | 0.0001301799651878 | 362 | rna-XM_006412319.2 14424048 | 37 | 2856200 | 2856561 | Eutrema salsugineum 72664 | GAG|GTACTATTGT...GTTCTCTAAATT/TATAGACTAATG...TGCAG|TAT | 2 | 1 | 96.702 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);