introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
51 rows where transcript_id = 22607881
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, is_minor, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 122607227 | GT-AG | 0 | 1.000000099473604e-05 | 53192 | rna-XM_021220207.2 22607881 | 1 | 104103288 | 104156479 | Mus pahari 10093 | TGG|GTGAGCAGCG...TCATTTCTACCT/AAAATAATCATT...CACAG|CCT | 1 | 1 | 0.813 |
| 122607228 | GT-AG | 0 | 1.000000099473604e-05 | 6227 | rna-XM_021220207.2 22607881 | 2 | 104156564 | 104162790 | Mus pahari 10093 | AAG|GTAAGTGTGC...CTATCCTGTGCT/CCTGTGCTAACC...TTTAG|GGT | 1 | 1 | 2.296 |
| 122607229 | GT-AG | 0 | 1.000000099473604e-05 | 3840 | rna-XM_021220207.2 22607881 | 3 | 104162832 | 104166671 | Mus pahari 10093 | AAG|GTAAGTCTTT...AACGCTTTTTCT/TTTGCAGTCAGT...CCAAG|GGT | 0 | 1 | 3.021 |
| 122607230 | GT-AG | 0 | 1.000000099473604e-05 | 3454 | rna-XM_021220207.2 22607881 | 4 | 104166728 | 104170181 | Mus pahari 10093 | AGG|GTGAGTGTTG...GCTTCTTTCTCC/AATGAAGTCACA...CCCAG|GCA | 2 | 1 | 4.01 |
| 122607231 | GT-AG | 0 | 1.000000099473604e-05 | 3306 | rna-XM_021220207.2 22607881 | 5 | 104170279 | 104173584 | Mus pahari 10093 | GTG|GTAAGAAACT...TGTTTTTTAAAT/TGTTTTTTAAAT...TGTAG|CAA | 0 | 1 | 5.723 |
| 122607232 | GT-AG | 0 | 0.0001206802300283 | 3013 | rna-XM_021220207.2 22607881 | 6 | 104173734 | 104176746 | Mus pahari 10093 | CAG|GTTTGTTTGT...CCTCCCTTGGTT/CCTTGGTTCATG...TCTAG|AAT | 2 | 1 | 8.355 |
| 122607233 | GT-AG | 0 | 0.0001450509639043 | 605 | rna-XM_021220207.2 22607881 | 7 | 104176883 | 104177487 | Mus pahari 10093 | AAA|GTAGGTTCCA...CTCACATTGACT/AGGTTGCTCACA...TTCAG|TCT | 0 | 1 | 10.758 |
| 122607234 | GT-AG | 0 | 1.000000099473604e-05 | 1342 | rna-XM_021220207.2 22607881 | 8 | 104177646 | 104178987 | Mus pahari 10093 | CAG|GTAGGTGACT...TTTTTTTTTTCT/GGGAGTTTAATT...TCAAG|TGA | 2 | 1 | 13.549 |
| 122607235 | GT-AG | 0 | 1.000000099473604e-05 | 405 | rna-XM_021220207.2 22607881 | 9 | 104179070 | 104179474 | Mus pahari 10093 | ACG|GTGAGTGCAC...CTTGTGTTGAAT/CTTGTGTTGAAT...TCTAG|GAT | 0 | 1 | 14.997 |
| 122607236 | GT-AG | 0 | 1.000000099473604e-05 | 7284 | rna-XM_021220207.2 22607881 | 10 | 104179611 | 104186894 | Mus pahari 10093 | CTG|GTAAATACTC...TTTTTGTTATTT/CTTTTTGTTATT...CCCAG|TGA | 1 | 1 | 17.4 |
| 122607237 | GT-AG | 0 | 4.560543791901168e-05 | 3383 | rna-XM_021220207.2 22607881 | 11 | 104186968 | 104190350 | Mus pahari 10093 | GGC|GTAAGTATGA...TGCCCCTTTGTC/GGTTGCCCCATG...TCCAG|GCT | 2 | 1 | 18.689 |
| 122607238 | GT-AG | 0 | 4.429067572592335e-05 | 4024 | rna-XM_021220207.2 22607881 | 12 | 104190494 | 104194517 | Mus pahari 10093 | AAG|GTGTGTATCT...TTGTCTTTAGCC/ATTGTCTTTAGC...TCCAG|GTT | 1 | 1 | 21.215 |
| 122607239 | GT-AG | 0 | 1.000000099473604e-05 | 3701 | rna-XM_021220207.2 22607881 | 13 | 104194644 | 104198344 | Mus pahari 10093 | CAG|GTAAGGTTAG...TTGCTATTAAAG/AAAGATATAATT...CCCAG|GTG | 1 | 1 | 23.441 |
| 122607240 | GT-AG | 0 | 4.0468397910708624 | 1303 | rna-XM_021220207.2 22607881 | 14 | 104198470 | 104199772 | Mus pahari 10093 | GAG|GTATCTCTTT...GCCACCTTAGCT/TCTTTTTTTATG...TAAAG|CAC | 0 | 1 | 25.649 |
| 122607241 | GT-AG | 0 | 1.000000099473604e-05 | 881 | rna-XM_021220207.2 22607881 | 15 | 104199872 | 104200752 | Mus pahari 10093 | AAG|GTATTGGATT...CCCCATTTGATT/TAGGTACTCACG...TCCAG|GTG | 0 | 1 | 27.398 |
| 122607242 | GT-AG | 0 | 1.000000099473604e-05 | 4966 | rna-XM_021220207.2 22607881 | 16 | 104200826 | 104205791 | Mus pahari 10093 | ACT|GTGAGTAGTG...TTTGTCCTGACT/TTTGTCCTGACT...TGCAG|CCA | 1 | 1 | 28.688 |
| 122607243 | GT-AG | 0 | 1.000000099473604e-05 | 718 | rna-XM_021220207.2 22607881 | 17 | 104205896 | 104206613 | Mus pahari 10093 | AAG|GTGCGCACAT...TGTTGTTTGACA/TGTTGTTTGACA...CACAG|GCT | 0 | 1 | 30.525 |
| 122607244 | GT-AG | 0 | 1.000000099473604e-05 | 4813 | rna-XM_021220207.2 22607881 | 18 | 104206798 | 104211610 | Mus pahari 10093 | ACG|GTGAGTTCTA...ATGTCCTTATTG/AATGTCCTTATT...TGCAG|TGG | 1 | 1 | 33.775 |
| 122607245 | GT-AG | 0 | 7.063904333093662e-05 | 4699 | rna-XM_021220207.2 22607881 | 19 | 104211709 | 104216407 | Mus pahari 10093 | AAG|GTAACATGGG...AACCTCTTGTCT/AGTTCAGTAACC...TGCAG|TTC | 0 | 1 | 35.506 |
| 122607246 | GT-AG | 0 | 1.000000099473604e-05 | 308 | rna-XM_021220207.2 22607881 | 20 | 104216498 | 104216805 | Mus pahari 10093 | CTG|GTAAGGGGCC...AATGTTTTGGTT/GTCCCACTCATG...GATAG|GTC | 0 | 1 | 37.096 |
| 122607247 | GT-AG | 0 | 1.45234906322488e-05 | 6397 | rna-XM_021220207.2 22607881 | 21 | 104216907 | 104223303 | Mus pahari 10093 | CAC|GTGAGTTCTC...GAATCCTTGATT/CCTTGATTGACA...ACCAG|GAA | 2 | 1 | 38.88 |
| 122607248 | GT-AG | 0 | 1.000000099473604e-05 | 14336 | rna-XM_021220207.2 22607881 | 22 | 104223439 | 104237774 | Mus pahari 10093 | CCA|GTAAGTGTTC...GGGTCTTTTCCT/CTTTTCCTCAGG...TGCAG|GCT | 2 | 1 | 41.265 |
| 122607249 | GT-AG | 0 | 1.000000099473604e-05 | 45739 | rna-XM_021220207.2 22607881 | 23 | 104237884 | 104283622 | Mus pahari 10093 | AAG|GTAGGTGCTA...CCTACCTGAATT/ATTCTGCTCACC...TGCAG|GGG | 0 | 1 | 43.19 |
| 122607250 | GT-AG | 0 | 5.007665581871892e-05 | 3271 | rna-XM_021220207.2 22607881 | 24 | 104283694 | 104286964 | Mus pahari 10093 | CAG|GTAAGCTTCC...TGCTTCTGATCA/CTGCTTCTGATC...TATAG|CAA | 2 | 1 | 44.444 |
| 122607251 | GT-AG | 0 | 1.000000099473604e-05 | 22622 | rna-XM_021220207.2 22607881 | 25 | 104287072 | 104309693 | Mus pahari 10093 | ATG|GTGAGTGGAT...TTCCCACTGACT/CACTGACTGACA...TCCAG|ACT | 1 | 1 | 46.335 |
| 122607252 | GT-AG | 0 | 1.000000099473604e-05 | 2051 | rna-XM_021220207.2 22607881 | 26 | 104309822 | 104311872 | Mus pahari 10093 | GTG|GTGAGTCATG...TGGCTTTTATCG/TTGGCTTTTATC...TCCAG|GGA | 0 | 1 | 48.596 |
| 122607253 | GT-AG | 0 | 1.000000099473604e-05 | 108806 | rna-XM_021220207.2 22607881 | 27 | 104311969 | 104420774 | Mus pahari 10093 | ATT|GTAAGTGCCC...TCTTTCCTATCT/TCCTATCTAATT...TACAG|GGG | 0 | 1 | 50.291 |
| 122607254 | GT-AG | 0 | 1.000000099473604e-05 | 8428 | rna-XM_021220207.2 22607881 | 28 | 104420877 | 104429304 | Mus pahari 10093 | GTG|GTGAGTATCT...AGCTTCGTCACT/AGCTTCGTCACT...TGCAG|GAT | 0 | 1 | 52.093 |
| 122607255 | GT-AG | 0 | 1.8717206690691584e-05 | 80894 | rna-XM_021220207.2 22607881 | 29 | 104429400 | 104510293 | Mus pahari 10093 | CAA|GTAAGTTCAG...TTTCTCTTGGTC/GAGTGTGTAATT...TTCAG|AGT | 2 | 1 | 53.771 |
| 122607256 | GT-AG | 0 | 1.000000099473604e-05 | 4165 | rna-XM_021220207.2 22607881 | 30 | 104510373 | 104514537 | Mus pahari 10093 | CAG|GTAAAAGGAG...TCAGCCTTATCT/CTCAGCCTTATC...TTCAG|CTG | 0 | 1 | 55.167 |
| 122607257 | GT-AG | 0 | 1.000000099473604e-05 | 9603 | rna-XM_021220207.2 22607881 | 31 | 104514639 | 104524241 | Mus pahari 10093 | CAA|GTAAGTCCTC...AGGTCTGTAAAC/AGGTCTGTAAAC...TCTAG|GTA | 2 | 1 | 56.951 |
| 122607258 | GT-AG | 0 | 1.000000099473604e-05 | 8313 | rna-XM_021220207.2 22607881 | 32 | 104524301 | 104532613 | Mus pahari 10093 | TGG|GTGAGTAGCT...CATTTCTTGATT/CATTTCTTGATT...GAAAG|GTC | 1 | 1 | 57.993 |
| 122607259 | GT-AG | 0 | 1.000000099473604e-05 | 9429 | rna-XM_021220207.2 22607881 | 33 | 104532763 | 104542191 | Mus pahari 10093 | ATG|GTAAGAGTCT...TTATTTATACTT/TAATTATTTATA...TATAG|TTT | 0 | 1 | 60.625 |
| 122607260 | GT-AG | 0 | 1.000000099473604e-05 | 184 | rna-XM_021220207.2 22607881 | 34 | 104542278 | 104542461 | Mus pahari 10093 | AAT|GTAAGTGTTG...GTTTCTTTCTTG/TATGTAATTATT...TTCAG|CCT | 2 | 1 | 62.144 |
| 122607261 | GT-AG | 0 | 1.000000099473604e-05 | 4313 | rna-XM_021220207.2 22607881 | 35 | 104542619 | 104546931 | Mus pahari 10093 | CTG|GTGAGCGAAA...TTTTTTTTAAAC/TTTTTTTTAAAC...TTCAG|AAT | 0 | 1 | 64.918 |
| 122607262 | GT-AG | 0 | 5.4625589936951706e-05 | 1417 | rna-XM_021220207.2 22607881 | 36 | 104546973 | 104548389 | Mus pahari 10093 | AAG|GTATGTGGAT...GAGTCCTTAAGA/TCCTATTTCATC...ACCAG|GTA | 2 | 1 | 65.642 |
| 122607263 | GT-AG | 0 | 1.000000099473604e-05 | 3588 | rna-XM_021220207.2 22607881 | 37 | 104548481 | 104552068 | Mus pahari 10093 | AAG|GTAAGAAAAA...CATTCCTTGTTT/GTATTGTTCATT...CACAG|TGG | 0 | 1 | 67.25 |
| 122607264 | GT-AG | 0 | 1.000000099473604e-05 | 14552 | rna-XM_021220207.2 22607881 | 38 | 104552189 | 104566740 | Mus pahari 10093 | AAG|GTAAGACACA...TCTCTCTTCTCT/CTCTGTGTCACT...CACAG|ATG | 0 | 1 | 69.369 |
| 122607265 | GT-AG | 0 | 1.000000099473604e-05 | 781 | rna-XM_021220207.2 22607881 | 39 | 104566831 | 104567611 | Mus pahari 10093 | CTG|GTCAGTCCTG...CAACCCTGAAAT/TGAGTTTTCATT...TCCAG|AAA | 0 | 1 | 70.959 |
| 122607266 | GT-AG | 0 | 1.709690642708268e-05 | 3880 | rna-XM_021220207.2 22607881 | 40 | 104567717 | 104571596 | Mus pahari 10093 | CGG|GTAAGTTTGA...TGTTATTTAACA/TGTTATTTAACA...TCCAG|GGG | 0 | 1 | 72.814 |
| 122607267 | GT-AG | 1 | 99.99986273144084 | 116 | rna-XM_021220207.2 22607881 | 41 | 104571739 | 104571854 | Mus pahari 10093 | AGC|GTATCCTTTA...TCTTCCTTAACC/TCTTCCTTAACC...CTCAG|ACA | 1 | 1 | 75.322 |
| 122607268 | GT-AG | 0 | 0.0003955018614256 | 1956 | rna-XM_021220207.2 22607881 | 42 | 104571934 | 104573889 | Mus pahari 10093 | AAG|GTAACCACGC...TTCTGGTTAAAA/AAAAGTCTAATG...CACAG|TTT | 2 | 1 | 76.718 |
| 122607269 | GT-AG | 0 | 1.000000099473604e-05 | 3318 | rna-XM_021220207.2 22607881 | 43 | 104573975 | 104577292 | Mus pahari 10093 | GCG|GTAATTAAAA...ACCCTCCTGACA/ACCCTCCTGACA...TACAG|AAT | 0 | 1 | 78.219 |
| 122607270 | GT-AG | 0 | 1.000000099473604e-05 | 1572 | rna-XM_021220207.2 22607881 | 44 | 104577380 | 104578951 | Mus pahari 10093 | ATG|GTAAGAATAC...ATCTTCTTTTCC/GTGATGCCCACC...TGCAG|GTA | 0 | 1 | 79.756 |
| 122607271 | GC-AG | 0 | 1.000000099473604e-05 | 833 | rna-XM_021220207.2 22607881 | 45 | 104579129 | 104579961 | Mus pahari 10093 | AAG|GCACGTGACA...AGTGCCATGATA/CATGTGCTAAAA...CACAG|GCT | 0 | 1 | 82.883 |
| 122607272 | GT-AG | 0 | 1.000000099473604e-05 | 5098 | rna-XM_021220207.2 22607881 | 46 | 104580046 | 104585143 | Mus pahari 10093 | CAG|GTTTGTATAT...CCCTCCCAAATC/AGGAACATCACC...TTCAG|ATT | 0 | 1 | 84.367 |
| 122607273 | GT-AG | 0 | 4.2414271876505424e-05 | 6070 | rna-XM_021220207.2 22607881 | 47 | 104585282 | 104591351 | Mus pahari 10093 | ATG|GTAAGCGGGC...TGGGCCTTAACA/AGGGATCTGACC...CTCAG|CCC | 0 | 1 | 86.804 |
| 122607274 | GT-AG | 0 | 1.000000099473604e-05 | 4735 | rna-XM_021220207.2 22607881 | 48 | 104591498 | 104596232 | Mus pahari 10093 | TGG|GTGAGTGGGT...ATTACTTTAGAT/CATTACTTTAGA...TTTAG|ATT | 2 | 1 | 89.384 |
| 122607275 | GT-AG | 0 | 2.986300675922144e-05 | 3650 | rna-XM_021220207.2 22607881 | 49 | 104596432 | 104600081 | Mus pahari 10093 | ACG|GTAATTTAAC...CAAACCTTCACT/CAAACCTTCACT...TGTAG|ATA | 0 | 1 | 92.899 |
| 122607276 | GT-AG | 0 | 1.000000099473604e-05 | 1115 | rna-XM_021220207.2 22607881 | 50 | 104600236 | 104601350 | Mus pahari 10093 | CCA|GTAAGTGGAA...CCCTCTATAACC/GACTTTGTGATC...TCCAG|GTC | 1 | 1 | 95.619 |
| 122607277 | GT-AG | 0 | 1.000000099473604e-05 | 3924 | rna-XM_021220207.2 22607881 | 51 | 104601503 | 104605426 | Mus pahari 10093 | CGG|GTAAGTGGGA...GGTTTCTTCATA/GGTTTCTTCATA...CCTAG|CAA | 0 | 1 | 98.304 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);