introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
37 rows where transcript_id = 22607886
This data as json, CSV (advanced)
Suggested facets: is_minor, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 122607400 | GT-AG | 0 | 1.000000099473604e-05 | 18266 | rna-XM_021194493.2 22607886 | 1 | 148293781 | 148312046 | Mus pahari 10093 | CAG|GTGGGTGTCG...TTCAGATTAACT/TTCAGATTAACT...TATAG|GCT | 2 | 1 | 0.252 |
| 122607401 | GT-AG | 0 | 1.000000099473604e-05 | 3537 | rna-XM_021194493.2 22607886 | 2 | 148312171 | 148315707 | Mus pahari 10093 | AAG|GTAAGAACTC...TTTGTTTTGATA/TTTGTTTTGATA...TTAAG|GTA | 0 | 1 | 2.488 |
| 122607402 | GT-AG | 0 | 0.0236310588805192 | 1892 | rna-XM_021194493.2 22607886 | 3 | 148315823 | 148317714 | Mus pahari 10093 | AAG|GTGCCTTTTA...ATTTTTTTGACT/ATTTTTTTGACT...TCTAG|AAC | 1 | 1 | 4.561 |
| 122607403 | GT-AG | 0 | 1.000000099473604e-05 | 5797 | rna-XM_021194493.2 22607886 | 4 | 148317859 | 148323655 | Mus pahari 10093 | CAG|GTTAGTAGTC...TATTTCTTCTCT/TGGTGACTGATT...TATAG|GTG | 1 | 1 | 7.157 |
| 122607404 | GT-AG | 0 | 0.0001411407901867 | 1782 | rna-XM_021194493.2 22607886 | 5 | 148323820 | 148325601 | Mus pahari 10093 | TCT|GTAGGTAAAA...TTTATTTTAATT/TTTATTTTAATT...TTTAG|ACT | 0 | 1 | 10.114 |
| 122607405 | GT-AG | 0 | 1.000000099473604e-05 | 2490 | rna-XM_021194493.2 22607886 | 6 | 148325739 | 148328228 | Mus pahari 10093 | GAG|GTAGTGATCT...CATTCCTTCATA/CATTCCTTCATA...TTTAG|CAA | 2 | 1 | 12.583 |
| 122607406 | GT-AG | 0 | 0.0005348340378282 | 672 | rna-XM_021194493.2 22607886 | 7 | 148328359 | 148329030 | Mus pahari 10093 | GAG|GTATTTGAAA...AAAACTTTGACC/TTTGACCTAATA...TTTAG|ACA | 0 | 1 | 14.927 |
| 122607407 | GT-AG | 0 | 1.000000099473604e-05 | 2853 | rna-XM_021194493.2 22607886 | 8 | 148329100 | 148331952 | Mus pahari 10093 | GAG|GTAAGGCATC...GCTTCTTTAAAC/GCTTCTTTAAAC...TTTAG|GTT | 0 | 1 | 16.171 |
| 122607408 | GT-AG | 0 | 1.000000099473604e-05 | 792 | rna-XM_021194493.2 22607886 | 9 | 148332043 | 148332834 | Mus pahari 10093 | GAG|GTAAGTATAT...TGATTTTTACTT/TTTTTACTTATT...AATAG|ATG | 0 | 1 | 17.793 |
| 122607409 | GT-AG | 0 | 1.0035495808941124e-05 | 88 | rna-XM_021194493.2 22607886 | 10 | 148332931 | 148333018 | Mus pahari 10093 | GAA|GTAAGTATCA...AACTCCTTTCTA/GCATATGTAACT...TGAAG|GTT | 0 | 1 | 19.524 |
| 122607410 | GT-AG | 0 | 0.0001967182086303 | 3619 | rna-XM_021194493.2 22607886 | 11 | 148333196 | 148336814 | Mus pahari 10093 | CAG|GTAACTAACT...CATGTTTTGATG/CATGTTTTGATG...TACAG|GAT | 0 | 1 | 22.715 |
| 122607411 | GT-AG | 0 | 0.0564258010055856 | 712 | rna-XM_021194493.2 22607886 | 12 | 148336956 | 148337667 | Mus pahari 10093 | AAG|GTATACTAAA...CTTTTCTTAAAA/ACTTTTCTTAAA...TGCAG|GTG | 0 | 1 | 25.257 |
| 122607412 | GT-AG | 0 | 1.000000099473604e-05 | 4385 | rna-XM_021194493.2 22607886 | 13 | 148337793 | 148342177 | Mus pahari 10093 | CAG|GTAATTATTT...TGTTTCTTCTTC/ATAAGATTAATT...ATTAG|TAT | 2 | 1 | 27.51 |
| 122607413 | GT-AG | 0 | 1.000000099473604e-05 | 3242 | rna-XM_021194493.2 22607886 | 14 | 148342299 | 148345540 | Mus pahari 10093 | CAG|GTAAGAACTG...TACGTCATAACT/ATAACTCTGATT...TTCAG|AAC | 0 | 1 | 29.692 |
| 122607414 | GT-AG | 0 | 1.000000099473604e-05 | 2151 | rna-XM_021194493.2 22607886 | 15 | 148345646 | 148347796 | Mus pahari 10093 | AAG|GTACATGATG...TAGCACTTAATA/ATTTGTTTCATA...TGTAG|GTC | 0 | 1 | 31.585 |
| 122607415 | GT-AG | 0 | 1.000000099473604e-05 | 261 | rna-XM_021194493.2 22607886 | 16 | 148347947 | 148348207 | Mus pahari 10093 | AAG|GTAAGTGTAG...ATTGGTTTAACT/ATTGGTTTAACT...TCTAG|GAA | 0 | 1 | 34.289 |
| 122607416 | GT-AG | 0 | 1.000000099473604e-05 | 508 | rna-XM_021194493.2 22607886 | 17 | 148348351 | 148348858 | Mus pahari 10093 | TAA|GTGAGTATTT...GTGACCTTTTCC/TTTTGCCTCAGG...TTTAG|GTT | 2 | 1 | 36.867 |
| 122607417 | GT-AG | 0 | 1.000000099473604e-05 | 1572 | rna-XM_021194493.2 22607886 | 18 | 148349034 | 148350605 | Mus pahari 10093 | AAG|GTAAGGTATA...TTTCCCTAAATC/TTTTCCCTAAAT...TTTAG|GAG | 0 | 1 | 40.022 |
| 122607418 | GT-AG | 0 | 1.000000099473604e-05 | 2923 | rna-XM_021194493.2 22607886 | 19 | 148350810 | 148353732 | Mus pahari 10093 | CTG|GTGAGTAAAG...CATTTCTTTTTT/AATGATTTTACA...ACTAG|GTA | 0 | 1 | 43.699 |
| 122607419 | GT-AG | 0 | 0.0010191047030089 | 1216 | rna-XM_021194493.2 22607886 | 20 | 148354160 | 148355375 | Mus pahari 10093 | AAG|GTATATGTCT...TATATTTTGATT/CTTTTGCTCACT...ACTAG|GAT | 1 | 1 | 51.397 |
| 122607420 | GT-AG | 0 | 1.000000099473604e-05 | 689 | rna-XM_021194493.2 22607886 | 21 | 148355567 | 148356255 | Mus pahari 10093 | GAG|GTAAGTAGTC...ATTTTTTTAATA/ATTTTTTTAATA...TTTAG|GGT | 0 | 1 | 54.84 |
| 122607421 | GT-AG | 0 | 0.0807486764915204 | 365 | rna-XM_021194493.2 22607886 | 22 | 148356410 | 148356774 | Mus pahari 10093 | TTG|GTATGCATTC...GGACTTTTATTT/AGGACTTTTATT...AATAG|ATG | 1 | 1 | 57.617 |
| 122607422 | GT-AG | 0 | 1.000000099473604e-05 | 1415 | rna-XM_021194493.2 22607886 | 23 | 148356885 | 148358299 | Mus pahari 10093 | TTG|GTAATTAATA...TATTCCCTAGTA/ATTTTTTTCAAT...TTTAG|CTG | 0 | 1 | 59.6 |
| 122607423 | GT-AG | 1 | 99.9993936593276 | 1885 | rna-XM_021194493.2 22607886 | 24 | 148358502 | 148360386 | Mus pahari 10093 | CCT|GTATCCTTTA...ATTTCCTTGATG/ACTATTTTAAAA...CTCAG|GTG | 1 | 1 | 63.241 |
| 122607424 | GT-AG | 0 | 1.000000099473604e-05 | 1772 | rna-XM_021194493.2 22607886 | 25 | 148360536 | 148362307 | Mus pahari 10093 | GAG|GTAAGTGGGA...TACATCTTAAAA/CTAATTCTAATA...TTTAG|GCA | 0 | 1 | 65.928 |
| 122607425 | GT-AG | 0 | 1.000000099473604e-05 | 158 | rna-XM_021194493.2 22607886 | 26 | 148362461 | 148362618 | Mus pahari 10093 | CAG|GTAAGTGTGT...ATTTCCTTTTTC/CTTACATTTACA...TCTAG|GAT | 0 | 1 | 68.686 |
| 122607426 | GT-AG | 0 | 1.000000099473604e-05 | 84 | rna-XM_021194493.2 22607886 | 27 | 148362741 | 148362824 | Mus pahari 10093 | GAG|GTAATAAAAA...AATTGCTTACTT/AAATTGCTTACT...TAAAG|GGC | 2 | 1 | 70.885 |
| 122607427 | GT-AG | 0 | 1.000000099473604e-05 | 1738 | rna-XM_021194493.2 22607886 | 28 | 148362993 | 148364730 | Mus pahari 10093 | AAG|GTAAGAGCCG...CTTTCCTTATTT/TCTTTCCTTATT...TAAAG|GTT | 2 | 1 | 73.914 |
| 122607428 | GT-AG | 0 | 1.000000099473604e-05 | 462 | rna-XM_021194493.2 22607886 | 29 | 148364809 | 148365270 | Mus pahari 10093 | TCG|GTAAGAATTC...ATTTATTTATTT/TATTTATTTATT...CTTAG|AAA | 2 | 1 | 75.32 |
| 122607429 | GT-AG | 0 | 1.000000099473604e-05 | 2119 | rna-XM_021194493.2 22607886 | 30 | 148365401 | 148367519 | Mus pahari 10093 | CAG|GTAATTGTTT...GATTCTTTTTCT/TAATTGATGATT...TTCAG|AAC | 0 | 1 | 77.664 |
| 122607430 | GT-AG | 0 | 1.000000099473604e-05 | 83 | rna-XM_021194493.2 22607886 | 31 | 148367662 | 148367744 | Mus pahari 10093 | CAG|GTAAGGAAGA...AAAACTTTGTTC/CAGGCATTCAAA...GACAG|GTG | 1 | 1 | 80.224 |
| 122607431 | GT-AG | 0 | 1.000000099473604e-05 | 2183 | rna-XM_021194493.2 22607886 | 32 | 148367876 | 148370058 | Mus pahari 10093 | CAG|GTTAGGAATG...CTTTTCTTAGAT/TGTATATTAACT...CACAG|GTT | 0 | 1 | 82.585 |
| 122607432 | GT-AG | 0 | 1.000000099473604e-05 | 1006 | rna-XM_021194493.2 22607886 | 33 | 148370185 | 148371190 | Mus pahari 10093 | CAG|GTAGAGGTTA...GACACTTGAGCT/AGCCCACTTACA...TAAAG|GCG | 0 | 1 | 84.857 |
| 122607433 | GT-AG | 0 | 0.0001952844520699 | 874 | rna-XM_021194493.2 22607886 | 34 | 148371344 | 148372217 | Mus pahari 10093 | CAA|GTACGTATGT...TTTCTGTTGAAT/TTTCTGTTGAAT...CTAAG|TTG | 0 | 1 | 87.615 |
| 122607434 | GT-AG | 0 | 1.000000099473604e-05 | 5216 | rna-XM_021194493.2 22607886 | 35 | 148372430 | 148377645 | Mus pahari 10093 | CCG|GTAAGTAGCT...TTCATTTTAATA/GTATGTTTCATT...TGTAG|ATT | 2 | 1 | 91.437 |
| 122607435 | GT-AG | 0 | 1.000000099473604e-05 | 274 | rna-XM_021194493.2 22607886 | 36 | 148377806 | 148378079 | Mus pahari 10093 | CAG|GTGCGTCATC...TATGGTTTAATT/TATGGTTTAATT...TTTAG|AAA | 0 | 1 | 94.321 |
| 122607436 | GT-AG | 0 | 1.000000099473604e-05 | 1086 | rna-XM_021194493.2 22607886 | 37 | 148378251 | 148379336 | Mus pahari 10093 | AAG|GTAAAGACTT...CTCACTTTATCT/ATAAAACTCACT...ATTAG|GAT | 0 | 1 | 97.404 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);