introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
38 rows where transcript_id = 32191379
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 179732591 | GT-AG | 0 | 8.994195670891188e-05 | 28364 | rna-XM_047142958.1 32191379 | 1 | 893308633 | 893336996 | Schistocerca americana 7009 | AAG|GTAACAAGCT...ATTTTCTTAAAT/TATTTTCTTAAA...TCCAG|GCA | 1 | 1 | 11.412 |
| 179732592 | GT-AG | 0 | 1.4837336629216656e-05 | 5883 | rna-XM_047142958.1 32191379 | 2 | 893337158 | 893343040 | Schistocerca americana 7009 | AAG|GTATAAGCTG...TGTTTTTTATAA/TTGTTTTTTATA...TATAG|GTG | 0 | 1 | 13.227 |
| 179732593 | GT-AG | 0 | 0.0001523155445052 | 6223 | rna-XM_047142958.1 32191379 | 3 | 893343163 | 893349385 | Schistocerca americana 7009 | CGT|GTAAGTATTA...ATATTTTTACTA/TTTTTACTAATA...CATAG|AGA | 2 | 1 | 14.603 |
| 179732594 | GT-AG | 0 | 1.000000099473604e-05 | 405 | rna-XM_047142958.1 32191379 | 4 | 893349595 | 893349999 | Schistocerca americana 7009 | ATG|GTAAGAGAGA...TAAGATTTAACA/TAAGATTTAACA...TTCAG|CTG | 1 | 1 | 16.96 |
| 179732595 | GT-AG | 0 | 1.000000099473604e-05 | 534 | rna-XM_047142958.1 32191379 | 5 | 893350200 | 893350733 | Schistocerca americana 7009 | GTT|GTAAGTAACG...AGTTCTGCAACA/CACAAACTGATA...TACAG|CAA | 0 | 1 | 19.215 |
| 179732596 | GT-AG | 0 | 1.000000099473604e-05 | 4510 | rna-XM_047142958.1 32191379 | 6 | 893350876 | 893355385 | Schistocerca americana 7009 | AAG|GTAATAAGAA...TCTATCTAAACT/TTCTATCTAAAC...TTCAG|TGA | 1 | 1 | 20.816 |
| 179732597 | GT-AG | 0 | 1.000000099473604e-05 | 18386 | rna-XM_047142958.1 32191379 | 7 | 893355491 | 893373876 | Schistocerca americana 7009 | ATG|GTGAGTTGTA...ATACCTTTACCT/GATATACTTATC...TACAG|GAA | 1 | 1 | 22.0 |
| 179732598 | GT-AG | 0 | 0.0003050645080369 | 923 | rna-XM_047142958.1 32191379 | 8 | 893374084 | 893375006 | Schistocerca americana 7009 | CAG|GTATTTCGTT...AAATGTTTAATT/AAATGTTTAATT...TTTAG|GCA | 1 | 1 | 24.335 |
| 179732599 | GT-AG | 0 | 1.000000099473604e-05 | 14668 | rna-XM_047142958.1 32191379 | 9 | 893375190 | 893389857 | Schistocerca americana 7009 | ATG|GTAAGTACGT...TATGTTATGATA/TATGTTATGATA...TACAG|GTA | 1 | 1 | 26.398 |
| 179732600 | GT-AG | 0 | 0.0001210494245964 | 3768 | rna-XM_047142958.1 32191379 | 10 | 893390080 | 893393847 | Schistocerca americana 7009 | TAG|GTAAGCTACT...TTGTTTTTAATT/TTGTTTTTAATT...TTCAG|GGC | 1 | 1 | 28.902 |
| 179732601 | GT-AG | 0 | 1.000000099473604e-05 | 3563 | rna-XM_047142958.1 32191379 | 11 | 893393997 | 893397559 | Schistocerca americana 7009 | CAG|GTCAGTATGA...ATATATTTATTA/AATATATTTATT...TCCAG|GTG | 0 | 1 | 30.582 |
| 179732602 | GT-AG | 0 | 0.0064434468812728 | 668 | rna-XM_047142958.1 32191379 | 12 | 893397705 | 893398372 | Schistocerca americana 7009 | AAG|GTATCACACA...TGTTTCTAAATT/TTGTTTCTAAAT...TCTAG|ATG | 1 | 1 | 32.217 |
| 179732603 | GT-AG | 0 | 1.000000099473604e-05 | 2729 | rna-XM_047142958.1 32191379 | 13 | 893398625 | 893401353 | Schistocerca americana 7009 | GAG|GTGAGATTTG...ATTTTTTTATCT/AATTTTTTTATC...TTTAG|TTG | 1 | 1 | 35.059 |
| 179732604 | GT-AG | 0 | 0.000251214874054 | 2627 | rna-XM_047142958.1 32191379 | 14 | 893401539 | 893404165 | Schistocerca americana 7009 | AAT|GTAAGTTTTT...GAGTTCTTATGC/TGAGTTCTTATG...TCCAG|CAA | 0 | 1 | 37.145 |
| 179732605 | GT-AG | 0 | 1.000000099473604e-05 | 35301 | rna-XM_047142958.1 32191379 | 15 | 893404683 | 893439983 | Schistocerca americana 7009 | AAG|GTAAGTAATA...TGTGTTTTACTG/TTGTGTTTTACT...TTTAG|CAG | 1 | 1 | 42.975 |
| 179732606 | GT-AG | 0 | 1.000000099473604e-05 | 6265 | rna-XM_047142958.1 32191379 | 16 | 893440430 | 893446694 | Schistocerca americana 7009 | AAG|GTAATGTATA...ATATTTTTGGTT/GCAGTTATCATT...GACAG|AAT | 0 | 1 | 48.004 |
| 179732607 | GT-AG | 0 | 0.0007339140657012 | 11264 | rna-XM_047142958.1 32191379 | 17 | 893446895 | 893458158 | Schistocerca americana 7009 | AAG|GTAACATTTC...ACATTTTTATTG/AACATTTTTATT...TGCAG|TGG | 2 | 1 | 50.259 |
| 179732608 | GT-AG | 0 | 1.000000099473604e-05 | 2161 | rna-XM_047142958.1 32191379 | 18 | 893458559 | 893460719 | Schistocerca americana 7009 | AAA|GTGAGTTTCT...CATTACTTACTT/CCATTACTTACT...TTAAG|CCG | 0 | 1 | 54.77 |
| 179732609 | GT-AG | 0 | 1.000000099473604e-05 | 2536 | rna-XM_047142958.1 32191379 | 19 | 893460867 | 893463402 | Schistocerca americana 7009 | CAG|GTTAATTTTA...ATTACCTTGCTT/CTTAGTATAATT...TCTAG|GTC | 0 | 1 | 56.428 |
| 179732610 | GT-AG | 0 | 1.000000099473604e-05 | 10571 | rna-XM_047142958.1 32191379 | 20 | 893463566 | 893474136 | Schistocerca americana 7009 | GAG|GTAAGTAGCC...CTGTTTTTATGT/TCTGTTTTTATG...TACAG|AAG | 1 | 1 | 58.266 |
| 179732611 | GT-AG | 0 | 0.0095511150719127 | 22239 | rna-XM_047142958.1 32191379 | 21 | 893474273 | 893496511 | Schistocerca americana 7009 | CAG|GTATACACAA...TTCTTTTTGATC/TTCTTTTTGATC...ATCAG|GTT | 2 | 1 | 59.799 |
| 179732612 | GC-AG | 0 | 1.000000099473604e-05 | 14634 | rna-XM_047142958.1 32191379 | 22 | 893496766 | 893511399 | Schistocerca americana 7009 | AAG|GCAAGTCTGA...AAATGCTTGATG/GTTTTTCTAATT...TGCAG|TGC | 1 | 1 | 62.664 |
| 179732613 | GT-AG | 0 | 1.000000099473604e-05 | 12246 | rna-XM_047142958.1 32191379 | 23 | 893511565 | 893523810 | Schistocerca americana 7009 | AAG|GTAATTTAAT...AATTTTTTACTA/TAATTTTTTACT...TGAAG|GAG | 1 | 1 | 64.524 |
| 179732614 | GT-AG | 0 | 1.000000099473604e-05 | 13147 | rna-XM_047142958.1 32191379 | 24 | 893523925 | 893537071 | Schistocerca americana 7009 | CAG|GTAATAATTT...TGTGTTTTAAAT/TGTGTTTTAAAT...CAAAG|GGA | 1 | 1 | 65.81 |
| 179732615 | GT-AG | 0 | 2.2461599607323164e-05 | 1744 | rna-XM_047142958.1 32191379 | 25 | 893537150 | 893538893 | Schistocerca americana 7009 | CTG|GTAAGTTATT...ATTTTCTTACTT/TATTTTCTTACT...TTTAG|GTC | 1 | 1 | 66.689 |
| 179732616 | GT-AG | 0 | 0.002718554275422 | 6015 | rna-XM_047142958.1 32191379 | 26 | 893538971 | 893544985 | Schistocerca americana 7009 | CAG|GTATCTGTGC...AGAAAATTAGTG/AAGAAAATTAGT...TACAG|GTT | 0 | 1 | 67.558 |
| 179732617 | GT-AG | 0 | 1.000000099473604e-05 | 5028 | rna-XM_047142958.1 32191379 | 27 | 893545148 | 893550175 | Schistocerca americana 7009 | CAG|GTAATACAAA...TTTTCTTTATTT/GTTTTCTTTATT...TACAG|ACT | 0 | 1 | 69.384 |
| 179732618 | GT-AG | 0 | 1.000000099473604e-05 | 1252 | rna-XM_047142958.1 32191379 | 28 | 893550326 | 893551577 | Schistocerca americana 7009 | CAG|GTAATGTATT...TGTTATTTAATA/CTGTTATTTAAT...AATAG|GTG | 0 | 1 | 71.076 |
| 179732619 | GT-AG | 0 | 1.000000099473604e-05 | 5341 | rna-XM_047142958.1 32191379 | 29 | 893551917 | 893557257 | Schistocerca americana 7009 | CAG|GTAATGCCTG...TGCTTCTGAAAA/AGTTGTCTGATA...GACAG|ACT | 0 | 1 | 74.899 |
| 179732620 | GT-AG | 0 | 1.000000099473604e-05 | 2217 | rna-XM_047142958.1 32191379 | 30 | 893557598 | 893559814 | Schistocerca americana 7009 | AAG|GTAAGAAAAT...TGGTTGTTAAAA/AAGAATTTTATG...TTCAG|CAC | 1 | 1 | 78.733 |
| 179732621 | GT-AG | 0 | 1.000000099473604e-05 | 2733 | rna-XM_047142958.1 32191379 | 31 | 893560027 | 893562759 | Schistocerca americana 7009 | CAG|GTAATTGGTG...ATTACATCAATA/ATGTAAATTACA...TACAG|ATC | 0 | 1 | 81.123 |
| 179732622 | GT-AG | 0 | 1.000000099473604e-05 | 11189 | rna-XM_047142958.1 32191379 | 32 | 893562998 | 893574186 | Schistocerca americana 7009 | AGA|GTGAGTGGGG...ATACCTTTAATT/CATTTTTTAATG...TCTAG|CAA | 1 | 1 | 83.807 |
| 179732623 | GT-AG | 0 | 0.0003505253668252 | 815 | rna-XM_047142958.1 32191379 | 33 | 893574461 | 893575275 | Schistocerca americana 7009 | CAA|GTAAGCCTCA...CTCTTTTTATTT/TTTTATTTAATT...TTTAG|ATC | 2 | 1 | 86.897 |
| 179732624 | GT-AG | 0 | 1.000000099473604e-05 | 4635 | rna-XM_047142958.1 32191379 | 34 | 893575481 | 893580115 | Schistocerca americana 7009 | CAG|GTAATATCAT...TTTTCTCCAACA/CAATGAGTAATT...TTCAG|AAG | 0 | 1 | 89.208 |
| 179732625 | GT-AG | 0 | 1.000000099473604e-05 | 1647 | rna-XM_047142958.1 32191379 | 35 | 893580421 | 893582067 | Schistocerca americana 7009 | AAA|GTAAGTAATA...GTGTGCATAACT/GTTGATGTTATT...TGCAG|GTT | 2 | 1 | 92.648 |
| 179732626 | GT-AG | 0 | 3.138449662593481e-05 | 12388 | rna-XM_047142958.1 32191379 | 36 | 893582236 | 893594623 | Schistocerca americana 7009 | TCA|GTAAGTACCC...TTATCATTAATC/GGATATCTCATT...TCTAG|GTT | 2 | 1 | 94.542 |
| 179732627 | GT-AG | 0 | 1.000000099473604e-05 | 86 | rna-XM_047142958.1 32191379 | 37 | 893594820 | 893594905 | Schistocerca americana 7009 | CAG|GTAAATATTA...TTTCCATTAATT/TAATTATTTATC...TGTAG|GCT | 0 | 1 | 96.752 |
| 179732628 | GT-AG | 0 | 1.000000099473604e-05 | 9271 | rna-XM_047142958.1 32191379 | 38 | 893594991 | 893604261 | Schistocerca americana 7009 | TGG|GTAAGTCCAG...TGGTTCTAATTT/TTGGTTCTAATT...TCCAG|ATC | 1 | 1 | 97.711 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);