introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
36 rows where transcript_id = 26701855
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 148094898 | GT-AG | 0 | 1.000000099473604e-05 | 2738 | rna-XM_009493409.1 26701855 | 1 | 42532 | 45269 | Pelecanus crispus 36300 | ATC|GTAAGTACAT...TCAGCCTACACG/GTATTACGCAAA...TTTAG|ATT | 2 | 1 | 1.411 |
| 148094899 | GT-AG | 0 | 1.000000099473604e-05 | 1059 | rna-XM_009493409.1 26701855 | 2 | 41280 | 42338 | Pelecanus crispus 36300 | ATG|GTGAGTCTAC...AAGTTCATATCT/TCATATCTGACT...TTCAG|GTC | 0 | 1 | 5.601 |
| 148094900 | GT-AG | 0 | 1.000000099473604e-05 | 737 | rna-XM_009493409.1 26701855 | 3 | 40359 | 41095 | Pelecanus crispus 36300 | AAG|GTTAAAATAC...AAGGCCATAAAT/TGTGAACACATT...AACAG|TAA | 1 | 1 | 9.596 |
| 148094901 | GT-AG | 0 | 3.099120054027468e-05 | 301 | rna-XM_009493409.1 26701855 | 4 | 39987 | 40287 | Pelecanus crispus 36300 | CTC|GTAAGTATAC...AAATGTTTGACA/AAATGTTTGACA...AACAG|AAT | 0 | 1 | 11.138 |
| 148094902 | GT-AG | 0 | 0.0177484956705454 | 1219 | rna-XM_009493409.1 26701855 | 5 | 38682 | 39900 | Pelecanus crispus 36300 | TGA|GTATGTTACT...GAATCCTAATTA/GGAATCCTAATT...TTTAG|GCC | 2 | 1 | 13.005 |
| 148094903 | GT-AG | 0 | 1.000000099473604e-05 | 1105 | rna-XM_009493409.1 26701855 | 6 | 37494 | 38598 | Pelecanus crispus 36300 | ATG|GTAAGAGAAT...TTTGTCTTCTTT/TTGAAACTCACA...TGCAG|AAC | 1 | 1 | 14.807 |
| 148094904 | GT-AG | 0 | 3.960816739092873e-05 | 4864 | rna-XM_009493409.1 26701855 | 7 | 32542 | 37405 | Pelecanus crispus 36300 | AAA|GTAGGTGTGC...TGCTTTTTATTA/TTGCTTTTTATT...CAAAG|ACA | 2 | 1 | 16.717 |
| 148094905 | GT-AG | 0 | 3.391957914889174e-05 | 182 | rna-XM_009493409.1 26701855 | 8 | 32254 | 32435 | Pelecanus crispus 36300 | TCA|GTAAGTACTG...TTTTTCCTACTT/TTTTTTCCTACT...TTTAG|ATT | 0 | 1 | 19.019 |
| 148094906 | GT-AG | 0 | 1.000000099473604e-05 | 2371 | rna-XM_009493409.1 26701855 | 9 | 29759 | 32129 | Pelecanus crispus 36300 | CAA|GTGAGTATAC...TACTCTTTGATA/TACTCTTTGATA...AACAG|GTG | 1 | 1 | 21.711 |
| 148094907 | GT-AG | 0 | 1.000000099473604e-05 | 905 | rna-XM_009493409.1 26701855 | 10 | 28738 | 29642 | Pelecanus crispus 36300 | GTG|GTAAGTAGTA...ATTTCCTTAAGT/CATTTCCTTAAG...AATAG|GCA | 0 | 1 | 24.229 |
| 148094908 | GT-AG | 0 | 1.000000099473604e-05 | 744 | rna-XM_009493409.1 26701855 | 11 | 27826 | 28569 | Pelecanus crispus 36300 | ATG|GTAAAGAAAA...CTTTCCTTCCCT/TGTCTCCTGAGG...GACAG|GTA | 0 | 1 | 27.877 |
| 148094909 | GT-AG | 0 | 1.0652720730131516e-05 | 2373 | rna-XM_009493409.1 26701855 | 12 | 25255 | 27627 | Pelecanus crispus 36300 | ATG|GTAAACAGAA...AAATACTTGATT/AAATACTTGATT...TGCAG|GTT | 0 | 1 | 32.175 |
| 148094910 | GT-AG | 0 | 0.0004146692594461 | 1724 | rna-XM_009493409.1 26701855 | 13 | 23339 | 25062 | Pelecanus crispus 36300 | AAG|GTACTTTCAA...TTTTTATTAACA/TTTTTATTAACA...TTTAG|ATT | 0 | 1 | 36.344 |
| 148094911 | GT-AG | 0 | 1.000000099473604e-05 | 418 | rna-XM_009493409.1 26701855 | 14 | 22771 | 23188 | Pelecanus crispus 36300 | AAA|GTAAGATATC...GCTATTTTATTT/TGCTATTTTATT...GGCAG|GTA | 0 | 1 | 39.601 |
| 148094912 | GT-AG | 0 | 1.000000099473604e-05 | 367 | rna-XM_009493409.1 26701855 | 15 | 22277 | 22643 | Pelecanus crispus 36300 | AAG|GTAAGTGCAA...AGTATCATGACC/CAGGGTGTTACT...CCCAG|GCT | 1 | 1 | 42.358 |
| 148094913 | GT-AG | 0 | 0.0003780998931232 | 855 | rna-XM_009493409.1 26701855 | 16 | 21350 | 22204 | Pelecanus crispus 36300 | TTG|GTACGTCTGC...CTGGTCTTGAAT/TTCACGCTTATT...TGCAG|TCA | 1 | 1 | 43.921 |
| 148094914 | GT-AG | 0 | 1.000000099473604e-05 | 950 | rna-XM_009493409.1 26701855 | 17 | 20199 | 21148 | Pelecanus crispus 36300 | GAT|GTAAAGAATG...CCGCTCTTGTTT/AAAGGTTTCAGA...TACAG|ACT | 1 | 1 | 48.285 |
| 148094915 | GT-AG | 0 | 0.065199325082665 | 249 | rna-XM_009493409.1 26701855 | 18 | 19838 | 20086 | Pelecanus crispus 36300 | CAG|GTATCGTATT...TACATTTTATTT/ATACATTTTATT...CTTAG|CAT | 2 | 1 | 50.716 |
| 148094916 | GT-AG | 0 | 3.080832164846951e-05 | 939 | rna-XM_009493409.1 26701855 | 19 | 18828 | 19766 | Pelecanus crispus 36300 | AAG|GTATAAGTAT...TAATTCTCAACT/GTAATTCTCAAC...CCCAG|GAT | 1 | 1 | 52.258 |
| 148094917 | GT-AG | 0 | 1.000000099473604e-05 | 1133 | rna-XM_009493409.1 26701855 | 20 | 17555 | 18687 | Pelecanus crispus 36300 | CAG|GTAGTATTAC...ATTGCCTTCTCC/CTAAGCCTGATT...CTCAG|GTT | 0 | 1 | 55.297 |
| 148094918 | GT-AG | 0 | 1.9611905015988548e-05 | 1290 | rna-XM_009493409.1 26701855 | 21 | 16052 | 17341 | Pelecanus crispus 36300 | TTG|GTAGGTATAA...AAAGTTTTGACT/AAAGTTTTGACT...TACAG|GCT | 0 | 1 | 59.922 |
| 148094919 | GT-AG | 0 | 5.806878422778635e-05 | 522 | rna-XM_009493409.1 26701855 | 22 | 15478 | 15999 | Pelecanus crispus 36300 | GCT|GTAAGTTACA...AGATCTCTATTT/TTACTATTTATT...TACAG|TAA | 1 | 1 | 61.051 |
| 148094920 | GT-AG | 0 | 1.000000099473604e-05 | 1709 | rna-XM_009493409.1 26701855 | 23 | 13688 | 15396 | Pelecanus crispus 36300 | GAG|GTAAATGTAA...CCAGTTTTAAAT/CCAGTTTTAAAT...AACAG|GGA | 1 | 1 | 62.809 |
| 148094921 | GT-AG | 0 | 7.45256129212771e-05 | 1533 | rna-XM_009493409.1 26701855 | 24 | 11951 | 13483 | Pelecanus crispus 36300 | AAG|GTACACGGGG...ATTCTCTGAACA/TATTCTCTGAAC...TGCAG|GTT | 1 | 1 | 67.238 |
| 148094922 | GT-AG | 0 | 1.000000099473604e-05 | 718 | rna-XM_009493409.1 26701855 | 25 | 11157 | 11874 | Pelecanus crispus 36300 | ATG|GTAAGTGTAT...TCTTCTATAAAT/TCTTCTATAAAT...TGCAG|GCT | 2 | 1 | 68.888 |
| 148094923 | GT-AG | 0 | 1.000000099473604e-05 | 1565 | rna-XM_009493409.1 26701855 | 26 | 9435 | 10999 | Pelecanus crispus 36300 | CAG|GTAGGTCATC...CTTTTCGTATTT/TCGTATTTCATC...TTCAG|GGT | 0 | 1 | 72.297 |
| 148094924 | GT-AG | 0 | 1.000000099473604e-05 | 326 | rna-XM_009493409.1 26701855 | 27 | 9013 | 9338 | Pelecanus crispus 36300 | GTG|GTAGGAAGCC...ATGACCTTTTCA/GTAGGTCTGAAT...CACAG|GTA | 0 | 1 | 74.381 |
| 148094925 | GT-AG | 0 | 1.000000099473604e-05 | 1014 | rna-XM_009493409.1 26701855 | 28 | 7830 | 8843 | Pelecanus crispus 36300 | CAG|GTACTGGTAA...AAGTCTTTACCC/AAAGTCTTTACC...TGCAG|CAA | 1 | 1 | 78.05 |
| 148094926 | GT-AG | 0 | 1.0318319353358831e-05 | 448 | rna-XM_009493409.1 26701855 | 29 | 7215 | 7662 | Pelecanus crispus 36300 | CAG|GTGCCTGACC...ATTTCTTTCTCA/GCTTTTTGCATT...CCTAG|GAC | 0 | 1 | 81.676 |
| 148094927 | GT-AG | 0 | 0.000963533075631 | 583 | rna-XM_009493409.1 26701855 | 30 | 6473 | 7055 | Pelecanus crispus 36300 | GAG|GTACACAACC...TTGGCCTTGGTT/TTTTTTTTTTTT...TGCAG|TTT | 0 | 1 | 85.128 |
| 148094928 | GT-AG | 0 | 0.0066405103130761 | 470 | rna-XM_009493409.1 26701855 | 31 | 5943 | 6412 | Pelecanus crispus 36300 | AGT|GTAAGCTGTA...ATATTTTTAGTG/TGTCTTTTAAAT...CTTAG|ATA | 0 | 1 | 86.431 |
| 148094929 | GT-AG | 0 | 1.000000099473604e-05 | 758 | rna-XM_009493409.1 26701855 | 32 | 5091 | 5848 | Pelecanus crispus 36300 | CTG|GTGAGTGAAA...AGGTCCTCAGTC/CAGGTCCTCAGT...TCCAG|AAA | 1 | 1 | 88.472 |
| 148094930 | GT-AG | 0 | 1.000000099473604e-05 | 1657 | rna-XM_009493409.1 26701855 | 33 | 3265 | 4921 | Pelecanus crispus 36300 | CAG|GTAGGTCTGC...CAGATGTTCACT/CAGATGTTCACT...TAAAG|GTC | 2 | 1 | 92.141 |
| 148094931 | GT-AG | 0 | 1.000000099473604e-05 | 626 | rna-XM_009493409.1 26701855 | 34 | 2551 | 3176 | Pelecanus crispus 36300 | CAG|GTTAGAAACA...TTGTGCTTATTT/TTTGTGCTTATT...TGCAG|TTG | 0 | 1 | 94.051 |
| 148094932 | GT-AG | 0 | 1.000000099473604e-05 | 1039 | rna-XM_009493409.1 26701855 | 35 | 1437 | 2475 | Pelecanus crispus 36300 | GAG|GTAAGCAAAC...TGTTCCTTTTTC/AGTTTACTCATC...TCCAG|CTC | 0 | 1 | 95.68 |
| 148094933 | GT-AG | 0 | 1.000000099473604e-05 | 975 | rna-XM_009493409.1 26701855 | 36 | 353 | 1327 | Pelecanus crispus 36300 | CTG|GTAAGGGCAT...TATGCTTTTCCA/TGGGCTCTCAAA...CACAG|ACA | 1 | 1 | 98.046 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);