introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
34 rows where transcript_id = 14424038
This data as json, CSV (advanced)
Suggested facets: score, length, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 77101642 | GT-AG | 0 | 1.000000099473604e-05 | 91 | rna-XM_024149398.1 14424038 | 1 | 9985481 | 9985571 | Eutrema salsugineum 72664 | GAG|GTAATGGTCG...TTCTTCTTCTTC/TAGAAATTGAAA...AACAG|GAC | 0 | 1 | 2.867 |
| 77101643 | GT-AG | 0 | 1.000000099473604e-05 | 90 | rna-XM_024149398.1 14424038 | 2 | 9985666 | 9985755 | Eutrema salsugineum 72664 | CTG|GTGAGTTTCA...GTGTCTTTGAGA/AATTGATTAATT...TGCAG|ATA | 1 | 1 | 4.594 |
| 77101644 | GT-AG | 0 | 1.000000099473604e-05 | 91 | rna-XM_024149398.1 14424038 | 3 | 9985848 | 9985938 | Eutrema salsugineum 72664 | AAG|GTTAAATTTT...CGGTTTTTAATT/CTTTGTTTGATT...ACCAG|ATT | 0 | 1 | 6.284 |
| 77101645 | GT-AG | 0 | 1.000000099473604e-05 | 87 | rna-XM_024149398.1 14424038 | 4 | 9986406 | 9986492 | Eutrema salsugineum 72664 | AAG|GTGAGGCTAA...TTTTTCTTATAT/GTTTTTCTTATA...TCCAG|GAA | 2 | 1 | 14.866 |
| 77101646 | GT-AG | 0 | 1.000000099473604e-05 | 90 | rna-XM_024149398.1 14424038 | 5 | 9986704 | 9986793 | Eutrema salsugineum 72664 | TTG|GTATGGCAAA...ATTTCCACAATA/AATAGGATCATC...TTCAG|GTA | 0 | 1 | 18.743 |
| 77101647 | GT-AG | 0 | 1.000000099473604e-05 | 101 | rna-XM_024149398.1 14424038 | 6 | 9987094 | 9987194 | Eutrema salsugineum 72664 | CAG|GTCAGGTTTT...TATTTTTTGATT/TATTTTTTGATT...TGCAG|ATT | 0 | 1 | 24.256 |
| 77101648 | GT-AG | 0 | 0.0001884545696808 | 220 | rna-XM_024149398.1 14424038 | 7 | 9987358 | 9987577 | Eutrema salsugineum 72664 | TAG|GTACTGTTTT...TCTATTTTAAAT/CTTTTGCTCATT...TGTAG|TCA | 1 | 1 | 27.251 |
| 77101649 | GT-AG | 0 | 1.000000099473604e-05 | 121 | rna-XM_024149398.1 14424038 | 8 | 9987716 | 9987836 | Eutrema salsugineum 72664 | TCG|GTAAGTGGTT...TTATTCTTGCTT/TTCTTGCTTATT...CCCAG|GGC | 1 | 1 | 29.787 |
| 77101650 | GT-AG | 0 | 9.519307566681774e-05 | 138 | rna-XM_024149398.1 14424038 | 9 | 9987919 | 9988056 | Eutrema salsugineum 72664 | AAG|GTAGGCTGTT...AAAGTTTTATCA/TGTTTACTCATT...CATAG|GAA | 2 | 1 | 31.294 |
| 77101651 | GT-AG | 0 | 0.0001420027208227 | 127 | rna-XM_024149398.1 14424038 | 10 | 9988341 | 9988467 | Eutrema salsugineum 72664 | GAG|GTAGCTGATC...TCATTCTTTGTA/CTAAAACTCATT...TTTAG|TTT | 1 | 1 | 36.512 |
| 77101652 | GT-AG | 0 | 1.000000099473604e-05 | 76 | rna-XM_024149398.1 14424038 | 11 | 9988582 | 9988657 | Eutrema salsugineum 72664 | ATG|GTGAGCGGGA...TATGCTTTCTTC/CATCGTCTAACT...GTCAG|GTT | 1 | 1 | 38.607 |
| 77101653 | GT-AG | 0 | 1.000000099473604e-05 | 118 | rna-XM_024149398.1 14424038 | 12 | 9988903 | 9989020 | Eutrema salsugineum 72664 | GTG|GTTAGTGGTC...ATCCCCTTGATC/GCAAATTTGACA...TGCAG|TTT | 0 | 1 | 43.109 |
| 77101654 | GT-AG | 0 | 1.97187470599225e-05 | 113 | rna-XM_024149398.1 14424038 | 13 | 9989270 | 9989382 | Eutrema salsugineum 72664 | CTG|GTAAACAGAA...TCTTTCTTGGCT/ACTTGGCTTACT...TACAG|AAG | 0 | 1 | 47.685 |
| 77101655 | GT-AG | 0 | 0.0047190690890064 | 122 | rna-XM_024149398.1 14424038 | 14 | 9989539 | 9989660 | Eutrema salsugineum 72664 | GCT|GTATGATATC...TGTGCTTTATGA/CTGTGCTTTATG...AGCAG|GTA | 0 | 1 | 50.551 |
| 77101656 | GT-AG | 0 | 1.000000099473604e-05 | 122 | rna-XM_024149398.1 14424038 | 15 | 9989802 | 9989923 | Eutrema salsugineum 72664 | GAG|GTAATGCCAT...TGTATCTTCTCT/ACTGTACTAATT...ACCAG|GCT | 0 | 1 | 53.142 |
| 77101657 | GT-AG | 0 | 1.000000099473604e-05 | 158 | rna-XM_024149398.1 14424038 | 16 | 9990164 | 9990321 | Eutrema salsugineum 72664 | AAG|GTAATGCATA...AAATTCTGACCT/AAAATTCTGACC...TTCAG|TGG | 0 | 1 | 57.552 |
| 77101658 | GT-AG | 0 | 1.000000099473604e-05 | 252 | rna-XM_024149398.1 14424038 | 17 | 9990447 | 9990698 | Eutrema salsugineum 72664 | GAG|GTGGGATATG...TTTTTGTTGACC/TTTTTGTTGACC...GGCAG|CCT | 2 | 1 | 59.849 |
| 77101659 | GT-AG | 0 | 1.000000099473604e-05 | 99 | rna-XM_024149398.1 14424038 | 18 | 9990817 | 9990915 | Eutrema salsugineum 72664 | CAG|GTTGTGTAAT...TATCTTGTGATG/GAATAATTAATA...AGCAG|ACG | 0 | 1 | 62.018 |
| 77101660 | GT-AG | 0 | 0.078993044563372 | 83 | rna-XM_024149398.1 14424038 | 19 | 9991078 | 9991160 | Eutrema salsugineum 72664 | TGG|GTATGCATCT...GTGTCTTTATTG/TGTGTCTTTATT...CACAG|TTT | 0 | 1 | 64.994 |
| 77101661 | GT-AG | 0 | 1.000000099473604e-05 | 86 | rna-XM_024149398.1 14424038 | 20 | 9991309 | 9991394 | Eutrema salsugineum 72664 | CTG|GTCTGTGATT...AATTGCTTACCG/GAATTGCTTACC...TTTAG|TTG | 1 | 1 | 67.714 |
| 77101662 | GT-AG | 0 | 1.000000099473604e-05 | 88 | rna-XM_024149398.1 14424038 | 21 | 9991496 | 9991583 | Eutrema salsugineum 72664 | GAA|GTAAGATTCA...AATCATTTGACT/ATTTGACTCAGT...TTCAG|GTT | 0 | 1 | 69.57 |
| 77101663 | GT-AG | 0 | 0.0004276411697723 | 124 | rna-XM_024149398.1 14424038 | 22 | 9991746 | 9991869 | Eutrema salsugineum 72664 | GCG|GTACTTATTG...ATCATCTAAATA/TATGCACTGATC...TGCAG|GTG | 0 | 1 | 72.547 |
| 77101664 | GT-AG | 0 | 1.000000099473604e-05 | 123 | rna-XM_024149398.1 14424038 | 23 | 9991918 | 9992040 | Eutrema salsugineum 72664 | GCT|GTAAGAGTTG...CCACTATTGACT/CCACTATTGACT...AACAG|ATC | 0 | 1 | 73.429 |
| 77101665 | GT-AG | 0 | 0.0001695742717947 | 186 | rna-XM_024149398.1 14424038 | 24 | 9992149 | 9992334 | Eutrema salsugineum 72664 | TAT|GTAAGTTTAC...TTTTTTTTTTCC/GTAAAGCTGATT...TGCAG|AGA | 0 | 1 | 75.413 |
| 77101666 | GT-AG | 0 | 2.1858066915802817e-05 | 111 | rna-XM_024149398.1 14424038 | 25 | 9992410 | 9992520 | Eutrema salsugineum 72664 | TTG|GTAAATAACT...CCTGTCTTAATG/CCTGTCTTAATG...AACAG|GTT | 0 | 1 | 76.792 |
| 77101667 | GT-AG | 0 | 1.000000099473604e-05 | 103 | rna-XM_024149398.1 14424038 | 26 | 9992572 | 9992674 | Eutrema salsugineum 72664 | CAG|GTAAATCATG...CTCCTCTAAATA/AACATATTGATA...CGCAG|CAA | 0 | 1 | 77.729 |
| 77101668 | GT-AG | 0 | 1.000000099473604e-05 | 90 | rna-XM_024149398.1 14424038 | 27 | 9992822 | 9992911 | Eutrema salsugineum 72664 | CAG|GTTGGTCTTA...TTGGCCTTAAAT/TTGTATATCATT...GGCAG|TTA | 0 | 1 | 80.43 |
| 77101669 | GT-AG | 0 | 1.040254758410454e-05 | 134 | rna-XM_024149398.1 14424038 | 28 | 9993043 | 9993176 | Eutrema salsugineum 72664 | GAG|GTGTGTTGTT...TTTGGTTTATTA/TTTTGGTTTATT...TCTAG|ATT | 2 | 1 | 82.837 |
| 77101670 | GT-AG | 0 | 4.8727573705352935e-05 | 135 | rna-XM_024149398.1 14424038 | 29 | 9993265 | 9993399 | Eutrema salsugineum 72664 | GAG|GTAAACTATG...AATATTGTAATT/AATATTGTAATT...TGCAG|GAA | 0 | 1 | 84.454 |
| 77101671 | GT-AG | 0 | 0.0001224116861858 | 78 | rna-XM_024149398.1 14424038 | 30 | 9993460 | 9993537 | Eutrema salsugineum 72664 | AAT|GTAAGCAACA...TCTGTCTTATCT/CTCTGTCTTATC...TACAG|GAT | 0 | 1 | 85.557 |
| 77101672 | GT-AG | 0 | 2.714871567418595e-05 | 83 | rna-XM_024149398.1 14424038 | 31 | 9993779 | 9993861 | Eutrema salsugineum 72664 | TTG|GTATGAATAT...TGCGCCATACTA/GCCATACTAACA...TTTAG|AGA | 1 | 1 | 89.985 |
| 77101673 | GT-AG | 0 | 0.0004076879011325 | 128 | rna-XM_024149398.1 14424038 | 32 | 9994014 | 9994141 | Eutrema salsugineum 72664 | GAG|GTATAACTAT...ACAGTTTTAACA/ACAGTTTTAACA...TGCAG|ATA | 0 | 1 | 92.778 |
| 77101674 | GT-AG | 0 | 0.0001036156373618 | 257 | rna-XM_024149398.1 14424038 | 33 | 9994213 | 9994469 | Eutrema salsugineum 72664 | CAG|GTATGACTCA...TTTTTCTTGTCT/AGTCCATTTACA...TGCAG|TGT | 2 | 1 | 94.083 |
| 77101675 | GT-AG | 0 | 1.000000099473604e-05 | 104 | rna-XM_024149398.1 14424038 | 34 | 9994627 | 9994730 | Eutrema salsugineum 72664 | ACG|GTAATAAGTT...CTGACTTTACCT/ACGGAACTAACT...TTTAG|GTT | 0 | 1 | 96.968 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);