introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
33 rows where transcript_id = 22607860
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 122606586 | GT-AG | 0 | 1.000000099473604e-05 | 95118 | rna-XM_021196332.2 22607860 | 2 | 149905760 | 150000877 | Mus pahari 10093 | CTG|GTAAGAGTTT...CTTTTCTAAGAA/ACTAGATTAATA...CACAG|ATC | 0 | 1 | 14.722 |
| 122606587 | GT-AG | 0 | 0.0001200723198858 | 31450 | rna-XM_021196332.2 22607860 | 3 | 150001167 | 150032616 | Mus pahari 10093 | AAG|GTAACGTCTG...TCAGTCTTGGTA/AGGGATCTCAGT...TCCAG|AAC | 1 | 1 | 17.712 |
| 122606588 | GT-AG | 0 | 1.000000099473604e-05 | 45552 | rna-XM_021196332.2 22607860 | 4 | 150032934 | 150078485 | Mus pahari 10093 | GAG|GTAAGGGACC...TATGTCTTGAGT/TATGTCTTGAGT...TCCAG|GTG | 0 | 1 | 20.991 |
| 122606589 | GT-AG | 0 | 1.000000099473604e-05 | 3212 | rna-XM_021196332.2 22607860 | 5 | 150078632 | 150081843 | Mus pahari 10093 | CAG|GTAAGGAAGT...TTAGGCTTAAAA/ATGTCATTTATT...AATAG|GTC | 2 | 1 | 22.502 |
| 122606590 | GT-AG | 0 | 1.000000099473604e-05 | 1787 | rna-XM_021196332.2 22607860 | 6 | 150082103 | 150083889 | Mus pahari 10093 | GAG|GTGAGCGGGC...GTATTCTTGAAC/GTATTCTTGAAC...TGTAG|TTT | 0 | 1 | 25.181 |
| 122606591 | GT-AG | 0 | 2.0311735577999216e-05 | 10067 | rna-XM_021196332.2 22607860 | 7 | 150084096 | 150094162 | Mus pahari 10093 | GCA|GTAAGTCCCA...GTATCTTTTTTT/CTTCTGGTTATA...ATCAG|GTT | 2 | 1 | 27.312 |
| 122606592 | GT-AG | 0 | 1.000000099473604e-05 | 2780 | rna-XM_021196332.2 22607860 | 8 | 150094839 | 150097618 | Mus pahari 10093 | CAG|GTAAGGGGTG...GGACCATTAATA/TGGGGATTCACT...TGCAG|GAG | 0 | 1 | 34.306 |
| 122606593 | GT-AG | 0 | 1.000000099473604e-05 | 1076 | rna-XM_021196332.2 22607860 | 9 | 150097802 | 150098877 | Mus pahari 10093 | AGG|GTAGAGTAAA...CAGCCCAGGATC/GGGAGAATAATA...CCCAG|GGA | 0 | 1 | 36.199 |
| 122606594 | GT-AG | 0 | 1.000000099473604e-05 | 738 | rna-XM_021196332.2 22607860 | 10 | 150098996 | 150099733 | Mus pahari 10093 | TGG|GTAATGAATG...TTTCCTTTGATC/TTTCCTTTGATC...TGCAG|AGG | 1 | 1 | 37.42 |
| 122606595 | GC-AG | 0 | 1.000000099473604e-05 | 2682 | rna-XM_021196332.2 22607860 | 11 | 150099843 | 150102524 | Mus pahari 10093 | CAG|GCCAGTCTCT...ACGGCCATATTT/TTGCCAGTGACG...TGCAG|TGC | 2 | 1 | 38.547 |
| 122606596 | GT-AG | 0 | 1.000000099473604e-05 | 86 | rna-XM_021196332.2 22607860 | 12 | 150102648 | 150102733 | Mus pahari 10093 | CAG|GTAGGACACG...CAGCTCTAAAAG/GTGTTTTCCATT...CTCAG|CAT | 2 | 1 | 39.82 |
| 122606597 | GT-AG | 0 | 1.000000099473604e-05 | 2712 | rna-XM_021196332.2 22607860 | 13 | 150102871 | 150105582 | Mus pahari 10093 | TAG|GTTGGTTTAA...GGCTTCTGAATA/AGGCTTCTGAAT...TTCAG|ATT | 1 | 1 | 41.237 |
| 122606598 | GT-AG | 0 | 0.0526422803005727 | 1585 | rna-XM_021196332.2 22607860 | 14 | 150105822 | 150107406 | Mus pahari 10093 | CAG|GTATCACATG...CTTTTTTTAATT/CTTTTTTTAATT...TGCAG|AAA | 0 | 1 | 43.71 |
| 122606599 | GT-AG | 0 | 0.0004571678464848 | 84 | rna-XM_021196332.2 22607860 | 15 | 150107469 | 150107552 | Mus pahari 10093 | AAG|GTAACTCCTC...GTTGCTCTAACG/GTTGCTCTAACG...TTTAG|GTT | 2 | 1 | 44.351 |
| 122606600 | GT-AG | 0 | 1.000000099473604e-05 | 3714 | rna-XM_021196332.2 22607860 | 16 | 150107719 | 150111432 | Mus pahari 10093 | CAG|GTACAGAGAG...AGGTTTTTATCT/GATTCATTCATT...CACAG|GTC | 0 | 1 | 46.069 |
| 122606601 | GT-AG | 0 | 0.0007121666463862 | 1375 | rna-XM_021196332.2 22607860 | 17 | 150111541 | 150112915 | Mus pahari 10093 | AAG|GTACCGCTTC...CTGACTCTAATG/AGTCTTCTGATG...CACAG|GAA | 0 | 1 | 47.186 |
| 122606602 | GT-AG | 0 | 1.000000099473604e-05 | 1195 | rna-XM_021196332.2 22607860 | 18 | 150113033 | 150114227 | Mus pahari 10093 | AAG|GTTAGCTCCA...TTTCTTTTATAC/CTTTCTTTTATA...TGTAG|AGT | 0 | 1 | 48.396 |
| 122606603 | GT-AG | 0 | 0.006621280852564 | 8892 | rna-XM_021196332.2 22607860 | 19 | 150114387 | 150123278 | Mus pahari 10093 | AAG|GTACTCCTCT...CAGTCTTTAATT/CAGTCTTTAATT...TTCAG|GCT | 0 | 1 | 50.041 |
| 122606604 | GT-AG | 0 | 1.000000099473604e-05 | 2441 | rna-XM_021196332.2 22607860 | 20 | 150123409 | 150125849 | Mus pahari 10093 | ATG|GTAAGAGACA...TTCTCCTTACCT/ATTCTCCTTACC...TGTAG|CAG | 1 | 1 | 51.386 |
| 122606605 | GT-AG | 0 | 1.4061194964916922e-05 | 855 | rna-XM_021196332.2 22607860 | 21 | 150125892 | 150126746 | Mus pahari 10093 | AAG|GTAAACGTCA...CTCTTTTTCACT/CTCTTTTTCACT...TTCAG|ACA | 1 | 1 | 51.821 |
| 122606606 | GT-AG | 0 | 1.000000099473604e-05 | 998 | rna-XM_021196332.2 22607860 | 22 | 150126869 | 150127866 | Mus pahari 10093 | AAG|GTGGGAGATC...GTGTCATTTTCC/AATGGTGTCATT...GAAAG|GTG | 0 | 1 | 53.083 |
| 122606607 | GT-AG | 0 | 1.000000099473604e-05 | 8439 | rna-XM_021196332.2 22607860 | 23 | 150127985 | 150136423 | Mus pahari 10093 | CAG|GTAAGAGTAG...TTTGGCTTATTT/TTTATTTTCATT...AACAG|GCC | 1 | 1 | 54.304 |
| 122606608 | GT-AG | 0 | 1.765814916347299e-05 | 1039 | rna-XM_021196332.2 22607860 | 24 | 150136556 | 150137594 | Mus pahari 10093 | AAG|GTAGTGTTCC...GAAGTTTTGACA/GAAGTTTTGACA...TTTAG|GTT | 1 | 1 | 55.669 |
| 122606609 | GT-AG | 0 | 1.000000099473604e-05 | 5078 | rna-XM_021196332.2 22607860 | 25 | 150137883 | 150142960 | Mus pahari 10093 | ATG|GTAAGGAGCC...GTAATCTTGGCT/TAATGTGTAATC...TGCAG|ACC | 1 | 1 | 58.649 |
| 122606610 | GT-AG | 0 | 1.000000099473604e-05 | 1600 | rna-XM_021196332.2 22607860 | 26 | 150143125 | 150144724 | Mus pahari 10093 | ACT|GTGAGTACGC...GTGACCTTGATC/TTCTCTTTCATC...TTCAG|ATC | 0 | 1 | 60.346 |
| 122606611 | GT-AG | 0 | 1.000000099473604e-05 | 2068 | rna-XM_021196332.2 22607860 | 27 | 150144987 | 150147054 | Mus pahari 10093 | GAG|GTAGAATGAA...GCGTTGTTAACA/GCGTTGTTAACA...TCCAG|GAT | 1 | 1 | 63.056 |
| 122606612 | GT-AG | 0 | 1.000000099473604e-05 | 1101 | rna-XM_021196332.2 22607860 | 28 | 150147174 | 150148274 | Mus pahari 10093 | TTG|GTAATTAAGT...GTGGTCTTCCCT/ATGGAATTCATG...GACAG|ATG | 0 | 1 | 64.287 |
| 122606613 | GC-AG | 0 | 1.000000099473604e-05 | 2427 | rna-XM_021196332.2 22607860 | 29 | 150148404 | 150150830 | Mus pahari 10093 | CAG|GCAAGTCCAC...GTTCTCTTTGCT/ATGAGCCTCATG...TTCAG|GTT | 0 | 1 | 65.622 |
| 122606614 | GT-AG | 0 | 1.1418437999970863e-05 | 7127 | rna-XM_021196332.2 22607860 | 30 | 150151041 | 150158167 | Mus pahari 10093 | GAA|GTAAGTGTAT...TTCACTTTAAAA/CATATTTTCACT...TGCAG|ACC | 0 | 1 | 67.794 |
| 122606615 | GT-AG | 0 | 0.0637206594367727 | 1566 | rna-XM_021196332.2 22607860 | 31 | 150158327 | 150159892 | Mus pahari 10093 | CAG|GTAACCTCTT...CTGACCCTAACT/ATTTGATTCAGC...TTCAG|ACC | 0 | 1 | 69.439 |
| 122606616 | GT-AG | 0 | 8.727417086296541e-05 | 900 | rna-XM_021196332.2 22607860 | 32 | 150160112 | 150161011 | Mus pahari 10093 | CAG|GTAACGTGGA...TTTTTTTTTTCT/CCTGGATTTACC...CACAG|GCA | 0 | 1 | 71.705 |
| 122621588 | GT-AG | 0 | 1.000000099473604e-05 | 38880 | rna-XM_021196332.2 22607860 | 1 | 149865339 | 149904218 | Mus pahari 10093 | AAG|GTGAGCTTGA...AATGCCATAATG/TAATGTTTGACA...TACAG|GGC | 0 | 2.338 | |
| 122621589 | GT-AG | 0 | 1.000000099473604e-05 | 1671 | rna-XM_021196332.2 22607860 | 33 | 150161235 | 150162905 | Mus pahari 10093 | AAG|GTACTGTAGC...TGCTTCTTCATG/AATACACTAATT...TTCAG|GGG | 0 | 74.012 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);