introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
30 rows where transcript_id = 14424054
This data as json, CSV (advanced)
Suggested facets: score, length, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 77101839 | GT-AG | 0 | 1.000000099473604e-05 | 83 | rna-XM_024150087.1 14424054 | 1 | 13180753 | 13180835 | Eutrema salsugineum 72664 | TAG|GTCAGTGTTT...TGTTTCATATCT/ATATATTTCACT...TGTAG|AGT | 2 | 1 | 7.498 |
| 77101840 | GT-AG | 0 | 1.000000099473604e-05 | 86 | rna-XM_024150087.1 14424054 | 2 | 13180243 | 13180328 | Eutrema salsugineum 72664 | CAG|GTGAGAGGTT...TGCATTTTACAT/ATGTTACTGATG...TGAAG|GCG | 0 | 1 | 17.073 |
| 77101841 | GT-AG | 0 | 7.627333433761374e-05 | 72 | rna-XM_024150087.1 14424054 | 3 | 13179919 | 13179990 | Eutrema salsugineum 72664 | AAT|GTAAGTTGGT...TTGTCGTTATCT/TCTGTCATCACT...TATAG|ATA | 0 | 1 | 22.764 |
| 77101842 | GT-AG | 0 | 1.000000099473604e-05 | 91 | rna-XM_024150087.1 14424054 | 4 | 13179615 | 13179705 | Eutrema salsugineum 72664 | CAG|GTGATCTTCA...GCAAACTTAACT/ACTCGGCTCACT...TCTAG|GAT | 0 | 1 | 27.575 |
| 77101843 | GT-AG | 0 | 1.000000099473604e-05 | 88 | rna-XM_024150087.1 14424054 | 5 | 13179395 | 13179482 | Eutrema salsugineum 72664 | GAG|GTAGGAATAT...TTTTTCTAAGTC/TTTTTTCTAAGT...CGCAG|ATA | 0 | 1 | 30.556 |
| 77101844 | GT-AG | 0 | 0.0047964465765036 | 81 | rna-XM_024150087.1 14424054 | 6 | 13179184 | 13179264 | Eutrema salsugineum 72664 | ATG|GTATTTCTGG...TTCTTTTTATAA/TAATTTCTGATT...TATAG|GTA | 1 | 1 | 33.491 |
| 77101845 | GT-AG | 0 | 1.000000099473604e-05 | 74 | rna-XM_024150087.1 14424054 | 7 | 13179063 | 13179136 | Eutrema salsugineum 72664 | AAG|GTGGGCATTA...GTATATTTATAT/AGTATATTTATA...TGCAG|GTC | 0 | 1 | 34.553 |
| 77101846 | GT-AG | 0 | 1.000000099473604e-05 | 85 | rna-XM_024150087.1 14424054 | 8 | 13178876 | 13178960 | Eutrema salsugineum 72664 | CAG|GTAATGTAAA...TCGGTTTTATAT/CTCGGTTTTATA...GACAG|GTC | 0 | 1 | 36.856 |
| 77101847 | GT-AG | 0 | 4.37428235906996e-05 | 119 | rna-XM_024150087.1 14424054 | 9 | 13178649 | 13178767 | Eutrema salsugineum 72664 | CAG|GTAACATCCA...CTACTCATATCC/TTTCTACTCATA...AGCAG|GTT | 0 | 1 | 39.295 |
| 77101848 | GT-AG | 0 | 1.4262896266642309e-05 | 104 | rna-XM_024150087.1 14424054 | 10 | 13178402 | 13178505 | Eutrema salsugineum 72664 | CTG|GTAAATATTT...ATCTTTTTAGTC/CTTATACTGATT...TTTAG|GAT | 2 | 1 | 42.525 |
| 77101849 | GT-AG | 0 | 1.000000099473604e-05 | 99 | rna-XM_024150087.1 14424054 | 11 | 13178208 | 13178306 | Eutrema salsugineum 72664 | AAG|GTGTAATACA...CTTTTCTTATCA/TCTTTTCTTATC...TTTAG|GGG | 1 | 1 | 44.67 |
| 77101850 | GT-AG | 0 | 1.000000099473604e-05 | 90 | rna-XM_024150087.1 14424054 | 12 | 13178039 | 13178128 | Eutrema salsugineum 72664 | AGG|GTTGGTTTTA...TTCTCTCTGACA/TTCTCTCTGACA...TGCAG|GTC | 2 | 1 | 46.454 |
| 77101851 | GT-AG | 0 | 0.0014732395280158 | 81 | rna-XM_024150087.1 14424054 | 13 | 13177909 | 13177989 | Eutrema salsugineum 72664 | ACT|GTAAGTTTTT...TTTTCTTTTGCT/CTTTTGCTTACA...TTCAG|GTG | 0 | 1 | 47.561 |
| 77101852 | GT-AG | 0 | 1.000000099473604e-05 | 244 | rna-XM_024150087.1 14424054 | 14 | 13177584 | 13177827 | Eutrema salsugineum 72664 | AAG|GTGAGAAGTT...TTTATCTTGTCA/TCTTTGCTGATT...GTAAG|TAC | 0 | 1 | 49.39 |
| 77101853 | GT-AG | 0 | 1.000000099473604e-05 | 88 | rna-XM_024150087.1 14424054 | 15 | 13177400 | 13177487 | Eutrema salsugineum 72664 | CTC|GTGAGTGACA...ATGTTTTTATAT/TATGTTTTTATA...TGCAG|CTT | 0 | 1 | 51.558 |
| 77101854 | GT-AG | 0 | 0.0536931010265572 | 136 | rna-XM_024150087.1 14424054 | 16 | 13177160 | 13177295 | Eutrema salsugineum 72664 | AGT|GTATGTTCAT...TTATTCATGACG/TTATTATTCATG...AGCAG|ATT | 2 | 1 | 53.907 |
| 77101855 | GT-AG | 0 | 1.000000099473604e-05 | 228 | rna-XM_024150087.1 14424054 | 17 | 13176472 | 13176699 | Eutrema salsugineum 72664 | GAG|GTCAGCCTAG...ATATTTTTAGCT/TAGCTTTTCACA...TATAG|GTG | 0 | 1 | 64.295 |
| 77101856 | GT-AG | 0 | 0.0018473410563964 | 86 | rna-XM_024150087.1 14424054 | 18 | 13176344 | 13176429 | Eutrema salsugineum 72664 | GAG|GTACCAAAAT...TGCATCTTAACA/TGCATCTTAACA...GCTAG|CTA | 0 | 1 | 65.244 |
| 77101857 | GT-AG | 0 | 1.000000099473604e-05 | 88 | rna-XM_024150087.1 14424054 | 19 | 13176051 | 13176138 | Eutrema salsugineum 72664 | AAG|GTAAGATAAA...ATTGATTTAGCT/CTTGTTCTAATA...AATAG|GTG | 1 | 1 | 69.874 |
| 77101858 | GT-AG | 0 | 0.013466482918114 | 131 | rna-XM_024150087.1 14424054 | 20 | 13175786 | 13175916 | Eutrema salsugineum 72664 | AAG|GTAACTTATA...TTCCTTTTAATC/TTTAATCTCATC...GACAG|ATG | 0 | 1 | 72.9 |
| 77101859 | GT-AG | 0 | 2.878227045779775e-05 | 90 | rna-XM_024150087.1 14424054 | 21 | 13175515 | 13175604 | Eutrema salsugineum 72664 | GCG|GTAAATATTT...TTTGTTCTAACT/TTTGTTCTAACT...TGCAG|CTT | 1 | 1 | 76.987 |
| 77101860 | GT-AG | 0 | 0.0030399411803948 | 92 | rna-XM_024150087.1 14424054 | 22 | 13175187 | 13175278 | Eutrema salsugineum 72664 | AAA|GTATGTCCAT...ATGTTTTTAGCT/AATATGCTCAAT...CGCAG|GGT | 0 | 1 | 82.317 |
| 77101861 | GT-AG | 0 | 1.000000099473604e-05 | 219 | rna-XM_024150087.1 14424054 | 23 | 13174872 | 13175090 | Eutrema salsugineum 72664 | CAG|GTACATGGAA...CGTTTCTGGAAT/CTTGGTTTCACG...ATTAG|GCC | 0 | 1 | 84.485 |
| 77101862 | GT-AG | 0 | 1.000000099473604e-05 | 81 | rna-XM_024150087.1 14424054 | 24 | 13174695 | 13174775 | Eutrema salsugineum 72664 | TTG|GTAAGAAAAA...ATCACTTTCTCT/TAGAGACTAACA...TGCAG|CAT | 0 | 1 | 86.653 |
| 77101863 | GT-AG | 0 | 1.000000099473604e-05 | 106 | rna-XM_024150087.1 14424054 | 25 | 13174556 | 13174661 | Eutrema salsugineum 72664 | CTG|GTCAGTAATC...AGGTTCTTACCT/TAGGTTCTTACC...ATTAG|GTG | 0 | 1 | 87.398 |
| 77101864 | GT-AG | 0 | 1.332776213829402e-05 | 109 | rna-XM_024150087.1 14424054 | 26 | 13174318 | 13174426 | Eutrema salsugineum 72664 | AAG|GTGACCCATT...GTTGTTATAATG/GTTATAATGACT...AACAG|GTT | 0 | 1 | 90.312 |
| 77101865 | GT-AG | 0 | 0.0002791426442602 | 101 | rna-XM_024150087.1 14424054 | 27 | 13174156 | 13174256 | Eutrema salsugineum 72664 | AAG|GTATAATCTC...TGAGATTTGATA/TGAGATTTGATA...TGCAG|TTT | 1 | 1 | 91.689 |
| 77101866 | GT-AG | 0 | 1.000000099473604e-05 | 78 | rna-XM_024150087.1 14424054 | 28 | 13174022 | 13174099 | Eutrema salsugineum 72664 | TTG|GTAAGTTGTA...TCTCTCTCAATC/TTCTCTCTCAAT...TATAG|GTT | 0 | 1 | 92.954 |
| 77101867 | GT-AG | 0 | 1.000000099473604e-05 | 99 | rna-XM_024150087.1 14424054 | 29 | 13173866 | 13173964 | Eutrema salsugineum 72664 | AAG|GTTAGTTTCA...TTTGTTTTTTTG/GAATGAATGACT...TTTAG|GGT | 0 | 1 | 94.241 |
| 77101868 | GT-AG | 0 | 1.000000099473604e-05 | 181 | rna-XM_024150087.1 14424054 | 30 | 13173564 | 13173744 | Eutrema salsugineum 72664 | TGG|GTACGACATG...GATTCTTTACTT/CGATTCTTTACT...TGCAG|GTA | 1 | 1 | 96.974 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);