introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
54 rows where transcript_id = 22607869
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 122606834 | GT-AG | 0 | 1.000000099473604e-05 | 20213 | rna-XM_029535051.1 22607869 | 1 | 149398882 | 149419094 | Mus pahari 10093 | AGG|GTAAGACGAC...TTTTTCTTTTTT/TCTTTTTTTATT...TGCAG|ATG | 1 | 1 | 1.41 |
| 122606835 | GT-AG | 0 | 0.0024369588770957 | 1529 | rna-XM_029535051.1 22607869 | 2 | 149397297 | 149398825 | Mus pahari 10093 | GAG|GTGACTTTAT...TTCTCTTTAACA/TTCTCTTTAACA...AACAG|ATT | 0 | 1 | 2.307 |
| 122606836 | GT-AG | 0 | 1.000000099473604e-05 | 24530 | rna-XM_029535051.1 22607869 | 3 | 149372675 | 149397204 | Mus pahari 10093 | TAA|GTGAGTTGCT...GTGCCCTTTGTT/TGCTCCCTCATA...TGCAG|ATT | 2 | 1 | 3.78 |
| 122606837 | GT-AG | 0 | 1.000000099473604e-05 | 2705 | rna-XM_029535051.1 22607869 | 4 | 149369861 | 149372565 | Mus pahari 10093 | GGG|GTGAGTGCTC...ATTCTCTTACCT/TATTCTCTTACC...TTCAG|GCC | 0 | 1 | 5.526 |
| 122606838 | GT-AG | 0 | 4.957602570428907e-05 | 7399 | rna-XM_029535051.1 22607869 | 5 | 149362374 | 149369772 | Mus pahari 10093 | GAG|GTAAGCTGGC...CCATCATTAACT/CCATCATTAACT...TTCAG|AAG | 1 | 1 | 6.936 |
| 122606839 | GT-AG | 0 | 1.000000099473604e-05 | 554 | rna-XM_029535051.1 22607869 | 6 | 149361653 | 149362206 | Mus pahari 10093 | CAG|GTGGGAAGGG...GAAGCTTTGGGA/TTGAATCGCACA...TGCAG|ATC | 0 | 1 | 9.611 |
| 122606840 | GT-AG | 0 | 1.000000099473604e-05 | 2892 | rna-XM_029535051.1 22607869 | 7 | 149358632 | 149361523 | Mus pahari 10093 | GAG|GTGAGTGGGT...TGGGCCTGAGCC/TAATGTCTGATG...GACAG|TTA | 0 | 1 | 11.677 |
| 122606841 | GT-AG | 0 | 1.000000099473604e-05 | 78 | rna-XM_029535051.1 22607869 | 8 | 149358491 | 149358568 | Mus pahari 10093 | CGG|GTAAGGGGCA...TCACCCTTCTTC/CGTCAGCTCACC...TCCAG|GTT | 0 | 1 | 12.686 |
| 122606842 | GT-AG | 0 | 1.000000099473604e-05 | 1069 | rna-XM_029535051.1 22607869 | 9 | 149357371 | 149358439 | Mus pahari 10093 | AAG|GTGAGTGACA...TTTTTTTTAAGT/TTTTTTTTAAGT...CCCAG|ATC | 0 | 1 | 13.503 |
| 122606843 | GT-AG | 0 | 1.000000099473604e-05 | 540 | rna-XM_029535051.1 22607869 | 10 | 149356800 | 149357339 | Mus pahari 10093 | CTG|GTGAGTAATG...ACATCTTTAGAG/CACATCTTTAGA...GTCAG|GCC | 1 | 1 | 14.0 |
| 122606844 | GT-AG | 0 | 1.000000099473604e-05 | 618 | rna-XM_029535051.1 22607869 | 11 | 149356066 | 149356683 | Mus pahari 10093 | CCT|GTGAGTGCCT...GTGATTTTGACC/GTGATTTTGACC...CATAG|CCT | 0 | 1 | 15.858 |
| 122606845 | GT-AG | 0 | 0.0027997190919781 | 2724 | rna-XM_029535051.1 22607869 | 12 | 149353215 | 149355938 | Mus pahari 10093 | AGA|GTATGTACTG...GTTGTTTTAAGA/GTTGTTTTAAGA...TCCAG|TGG | 1 | 1 | 17.892 |
| 122606846 | GT-AG | 0 | 1.000000099473604e-05 | 2580 | rna-XM_029535051.1 22607869 | 13 | 149350531 | 149353110 | Mus pahari 10093 | AAG|GTTCGTATGT...TTTTCTTTGGCA/TACCTACTGATT...TGCAG|GTG | 0 | 1 | 19.558 |
| 122606847 | GT-AG | 0 | 1.000000099473604e-05 | 226 | rna-XM_029535051.1 22607869 | 14 | 149350236 | 149350461 | Mus pahari 10093 | AAG|GTTTGACTTT...TTTTCTGTTTCT/AGTGAATTCATG...GACAG|TTT | 0 | 1 | 20.663 |
| 122606848 | GT-AG | 0 | 1.000000099473604e-05 | 679 | rna-XM_029535051.1 22607869 | 15 | 149349513 | 149350191 | Mus pahari 10093 | CTG|GTGAGTTGCA...AAAGCTATAACT/TATAACTTCATA...CACAG|GGA | 2 | 1 | 21.368 |
| 122606849 | GT-AG | 0 | 1.000000099473604e-05 | 2462 | rna-XM_029535051.1 22607869 | 16 | 149346968 | 149349429 | Mus pahari 10093 | AAG|GTAAGTCACC...TTTTTTTTAACC/TTTTTTTTAACC...CATAG|ATT | 1 | 1 | 22.697 |
| 122606850 | GT-AG | 0 | 1.000000099473604e-05 | 1465 | rna-XM_029535051.1 22607869 | 17 | 149345464 | 149346928 | Mus pahari 10093 | CAG|GTTTATGCTT...ATAGCTTGAACG/GTGCTATTTATG...CTCAG|CAA | 1 | 1 | 23.322 |
| 122606851 | GT-AG | 0 | 1.000000099473604e-05 | 1173 | rna-XM_029535051.1 22607869 | 18 | 149344166 | 149345338 | Mus pahari 10093 | AAG|GTTGGTTTAT...TGGTCCTTGTCT/ATGGGAATAATT...TTCAG|GGA | 0 | 1 | 25.324 |
| 122606852 | GT-AG | 0 | 1.000000099473604e-05 | 6061 | rna-XM_029535051.1 22607869 | 19 | 149337988 | 149344048 | Mus pahari 10093 | GAG|GTAAATGTCG...TGAACCTTATGT/GTGGTACTGAAC...TTCAG|AAG | 0 | 1 | 27.198 |
| 122606853 | GT-AG | 0 | 1.000000099473604e-05 | 1076 | rna-XM_029535051.1 22607869 | 20 | 149336734 | 149337809 | Mus pahari 10093 | ATG|GTAAGCCTAG...TGACTCCTGACA/CAAGTATTCACT...ACCAG|GGA | 1 | 1 | 30.05 |
| 122606854 | GT-AG | 0 | 1.000000099473604e-05 | 3148 | rna-XM_029535051.1 22607869 | 21 | 149333461 | 149336608 | Mus pahari 10093 | CTG|GTGAGTTATC...TCTTTCTTGATT/TCTTTCTTGATT...TTCAG|CAA | 0 | 1 | 32.052 |
| 122606855 | GT-AG | 0 | 1.6473070157074275e-05 | 1858 | rna-XM_029535051.1 22607869 | 22 | 149331496 | 149333353 | Mus pahari 10093 | GAG|GTACAGTGAC...TAGGTTTTATTT/TTAGGTTTTATT...TATAG|ATA | 2 | 1 | 33.766 |
| 122606856 | GT-AG | 0 | 0.0002671012708023 | 1269 | rna-XM_029535051.1 22607869 | 23 | 149330034 | 149331302 | Mus pahari 10093 | GAG|GTAGACCTTG...TTTGCTTTCTTT/TGTAGACTCATT...CCCAG|CCT | 0 | 1 | 36.857 |
| 122606857 | GT-AG | 0 | 0.0024757037165927 | 2326 | rna-XM_029535051.1 22607869 | 24 | 149327552 | 149329877 | Mus pahari 10093 | AAG|GTATGCCTTC...GTCACCTAAGAT/TTATTGTTCATC...CACAG|TAT | 0 | 1 | 39.356 |
| 122606858 | GT-AG | 0 | 1.8962254957251368 | 1682 | rna-XM_029535051.1 22607869 | 25 | 149325738 | 149327419 | Mus pahari 10093 | ATG|GTACCTTTGG...CGATCTTTGATA/TTGATAGTGATT...AACAG|TAT | 0 | 1 | 41.47 |
| 122606859 | GT-AG | 0 | 1.000000099473604e-05 | 3871 | rna-XM_029535051.1 22607869 | 26 | 149321703 | 149325573 | Mus pahari 10093 | AAG|GTGAGTATGT...CCCTCGGTAACA/TTGGGTTCCATT...CACAG|CCT | 2 | 1 | 44.097 |
| 122606860 | GT-AG | 0 | 1.000000099473604e-05 | 2077 | rna-XM_029535051.1 22607869 | 27 | 149319511 | 149321587 | Mus pahari 10093 | GCG|GTGAGCCGTT...TGATTCTCATTC/GTGATTCTCATT...CTTAG|AAT | 0 | 1 | 45.939 |
| 122606861 | GT-AG | 0 | 2.1263594354741103e-05 | 1046 | rna-XM_029535051.1 22607869 | 28 | 149318359 | 149319404 | Mus pahari 10093 | AAG|GTATGTACAT...CTATTCTCAAAC/ACTATTCTCAAA...TGCAG|GCT | 1 | 1 | 47.637 |
| 122606862 | GT-AG | 0 | 1.000000099473604e-05 | 2370 | rna-XM_029535051.1 22607869 | 29 | 149315843 | 149318212 | Mus pahari 10093 | AGG|GTAAGAGGGG...ACGCCCGTACTT/CTGGGATTAAAG...TTTAG|GCC | 0 | 1 | 49.976 |
| 122606863 | GT-AG | 0 | 1.000000099473604e-05 | 426 | rna-XM_029535051.1 22607869 | 30 | 149315243 | 149315668 | Mus pahari 10093 | CTT|GTAAGTAACC...ATGATGATGACT/ATGATGATGACT...TTTAG|GGG | 0 | 1 | 52.763 |
| 122606864 | GT-AG | 0 | 0.0003061383379747 | 1614 | rna-XM_029535051.1 22607869 | 31 | 149313520 | 149315133 | Mus pahari 10093 | ACA|GTAAGTTTTA...TGTTTCTCATCG/CTGTTTCTCATC...TTCAG|GAG | 1 | 1 | 54.509 |
| 122606865 | GT-AG | 0 | 1.000000099473604e-05 | 1627 | rna-XM_029535051.1 22607869 | 32 | 149311815 | 149313441 | Mus pahari 10093 | CAG|GTGAAGGGAA...ATATCTTTTTCT/CTACTGTTAATA...CATAG|ATC | 1 | 1 | 55.758 |
| 122606866 | GT-AG | 0 | 1.000000099473604e-05 | 93 | rna-XM_029535051.1 22607869 | 33 | 149311540 | 149311632 | Mus pahari 10093 | GTG|GTAGGTGATA...GGCTATTTGACC/GGCTATTTGACC...TCTAG|GGT | 0 | 1 | 58.674 |
| 122606867 | GT-AG | 0 | 0.0444669350740738 | 115 | rna-XM_029535051.1 22607869 | 34 | 149311266 | 149311380 | Mus pahari 10093 | AAG|GTATCGCTTT...TCTGCTTTACTG/CTCTGCTTTACT...CACAG|GAT | 0 | 1 | 61.221 |
| 122606868 | GT-AG | 0 | 0.0001709948790351 | 913 | rna-XM_029535051.1 22607869 | 35 | 149310254 | 149311166 | Mus pahari 10093 | GAG|GTACGTATTC...CTTCCCTTCTCT/GTCAGGTTTACT...CCCAG|ATC | 0 | 1 | 62.806 |
| 122606869 | GT-AG | 0 | 2.1883639926262612e-05 | 2034 | rna-XM_029535051.1 22607869 | 36 | 149308058 | 149310091 | Mus pahari 10093 | GTG|GTACAGTATC...CAGATCTGGAAA/TCCTGTCAGATC...CCTAG|TTC | 0 | 1 | 65.401 |
| 122606870 | GT-AG | 0 | 0.0005841560689764 | 3075 | rna-XM_029535051.1 22607869 | 37 | 149304817 | 149307891 | Mus pahari 10093 | AAG|GTATGTCTAT...GTGTCCTAGATC/TCCTAGATCACA...TGCAG|CCT | 1 | 1 | 68.06 |
| 122606871 | GT-AG | 0 | 1.2115226002489318e-05 | 2783 | rna-XM_029535051.1 22607869 | 38 | 149301957 | 149304739 | Mus pahari 10093 | AAG|GTAGGTTGTA...TCCTTCTTCATG/TCCTTCTTCATG...TCCAG|TGC | 0 | 1 | 69.294 |
| 122606872 | GT-AG | 0 | 0.0019435854257305 | 1352 | rna-XM_029535051.1 22607869 | 39 | 149300548 | 149301899 | Mus pahari 10093 | CAT|GTATGTATAT...CTGACCTCATAT/CTGCTACTGACC...CCCAG|CTG | 0 | 1 | 70.207 |
| 122606873 | GT-AG | 0 | 1.8985165036200067e-05 | 2270 | rna-XM_029535051.1 22607869 | 40 | 149298266 | 149300535 | Mus pahari 10093 | AAG|GTAACGATCG...TTCTTTTTCATT/TTCTTTTTCATT...TCTAG|GAG | 0 | 1 | 70.399 |
| 122606874 | GT-AG | 0 | 5.024103483859152e-05 | 1518 | rna-XM_029535051.1 22607869 | 41 | 149296649 | 149298166 | Mus pahari 10093 | AAG|GTACTTGGAC...TCTGTCTTAATG/CTCTGTCTTAAT...TTCAG|ATC | 0 | 1 | 71.985 |
| 122606875 | GT-AG | 0 | 1.000000099473604e-05 | 1433 | rna-XM_029535051.1 22607869 | 42 | 149295090 | 149296522 | Mus pahari 10093 | AAA|GTGAGTAAAG...GGGTCCTTGAAG/CTTGAAGTCATC...CTCAG|GGC | 0 | 1 | 74.003 |
| 122606876 | GT-AG | 0 | 1.000000099473604e-05 | 522 | rna-XM_029535051.1 22607869 | 43 | 149294412 | 149294933 | Mus pahari 10093 | CTG|GTAAGAGTTT...AACATCTTACCT/TAACATCTTACC...TACAG|TGT | 0 | 1 | 76.502 |
| 122606877 | GT-AG | 0 | 7.880580898694279e-05 | 4613 | rna-XM_029535051.1 22607869 | 44 | 149289707 | 149294319 | Mus pahari 10093 | CAG|GTAACATACA...AGGCCCCTGATC/AGGCCCCTGATC...CTTAG|AAT | 2 | 1 | 77.975 |
| 122606878 | GT-AG | 0 | 1.598368268977974e-05 | 649 | rna-XM_029535051.1 22607869 | 45 | 149288887 | 149289535 | Mus pahari 10093 | TGT|GTAAGTTGCT...GGAACCTTGTGC/TGTGGTGTCATC...CACAG|TTC | 2 | 1 | 80.714 |
| 122606879 | GT-AG | 0 | 1.000000099473604e-05 | 2105 | rna-XM_029535051.1 22607869 | 46 | 149286633 | 149288737 | Mus pahari 10093 | TTG|GTGAGCACAA...TTGTTCCCAACT/GTCAATGTCATT...TCCAG|AGG | 1 | 1 | 83.101 |
| 122606880 | GT-AG | 0 | 1.000000099473604e-05 | 2074 | rna-XM_029535051.1 22607869 | 47 | 149284419 | 149286492 | Mus pahari 10093 | CAG|GTAAAAGGGA...TCTTCTTTCGTT/CCATGTGTGATT...TTTAG|GGA | 0 | 1 | 85.344 |
| 122606881 | GT-AG | 0 | 1.000000099473604e-05 | 90 | rna-XM_029535051.1 22607869 | 48 | 149284240 | 149284329 | Mus pahari 10093 | GAA|GTGAGTAAGC...CGGTCCAGGACT/TAGACATTCAAA...TTCAG|GTA | 2 | 1 | 86.769 |
| 122606882 | GT-AG | 0 | 1.000000099473604e-05 | 3173 | rna-XM_029535051.1 22607869 | 49 | 149280971 | 149284143 | Mus pahari 10093 | AGG|GTGAGGATAG...GATCTCTTGTCA/TTGTCAATCACA...TACAG|CTG | 2 | 1 | 88.307 |
| 122606883 | GT-AG | 0 | 1.2373601298082745e-05 | 2166 | rna-XM_029535051.1 22607869 | 50 | 149278663 | 149280828 | Mus pahari 10093 | AAA|GTAAGCCCTG...CATGTCTTTTCT/CTGATATTGATC...TTTAG|GAG | 0 | 1 | 90.581 |
| 122606884 | GT-AG | 0 | 0.0016717507982509 | 1414 | rna-XM_029535051.1 22607869 | 51 | 149277149 | 149278562 | Mus pahari 10093 | TGG|GTATTGCTTA...CTTTTCTGGACC/CTGGACCTCATG...GCCAG|GTT | 1 | 1 | 92.183 |
| 122606885 | GT-AG | 0 | 0.0003360913168427 | 1930 | rna-XM_029535051.1 22607869 | 52 | 149275028 | 149276957 | Mus pahari 10093 | GCT|GTATGAGTAT...ATGTGTTTAACC/ATGTGTTTAACC...TGAAG|GGG | 0 | 1 | 95.243 |
| 122606886 | GT-AG | 0 | 1.000000099473604e-05 | 440 | rna-XM_029535051.1 22607869 | 53 | 149274478 | 149274917 | Mus pahari 10093 | AAA|GTGAGAGGCC...TAGTCCTTCATA/TAGTCCTTCATA...TGCAG|CCG | 2 | 1 | 97.005 |
| 122606887 | GT-AG | 0 | 1.000000099473604e-05 | 1811 | rna-XM_029535051.1 22607869 | 54 | 149272519 | 149274329 | Mus pahari 10093 | CCG|GTACGAAGCC...AAAGTCATATCC/CATGGACTGACA...TTCAG|AAC | 0 | 1 | 99.375 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);