introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
33 rows where transcript_id = 15214366
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
82387222 | GT-AG | 0 | 1.000000099473604e-05 | 4072 | rna-XM_033915540.1 15214366 | 1 | 374731662 | 374735733 | Geotrypetes seraphini 260995 | CAG|GTGAGTGGTT...GATCCTATGACT/GATCCTATGACT...TGCAG|ATG | 1 | 1 | 1.667 |
82387223 | GT-AG | 0 | 0.0004458745107572 | 16732 | rna-XM_033915540.1 15214366 | 2 | 374735848 | 374752579 | Geotrypetes seraphini 260995 | GTG|GTAAGCTATG...TTGCTTTTAAAC/ATTTTGTTAAAT...ACCAG|ATG | 1 | 1 | 3.459 |
82387224 | GT-AG | 0 | 0.0161699265218939 | 12402 | rna-XM_033915540.1 15214366 | 3 | 374752700 | 374765101 | Geotrypetes seraphini 260995 | GTG|GTATGCAAAA...TAATTTTTAGCC/TTAATTTTTAGC...ACAAG|AAT | 1 | 1 | 5.346 |
82387225 | GT-AG | 0 | 0.0001070308402597 | 4722 | rna-XM_033915540.1 15214366 | 4 | 374765330 | 374770051 | Geotrypetes seraphini 260995 | GTG|GTATGTGCAG...TGTTTCTTCTCC/CCAGCTCTGAAA...TTCAG|TAT | 1 | 1 | 8.931 |
82387226 | GT-AG | 0 | 0.0006201822145545 | 9223 | rna-XM_033915540.1 15214366 | 5 | 374770163 | 374779385 | Geotrypetes seraphini 260995 | GTG|GTAACTCTAG...TTCATCTTATGG/TTTGTGTTCATC...CTTAG|GTG | 1 | 1 | 10.676 |
82387227 | GT-AG | 0 | 0.0038993777436089 | 12920 | rna-XM_033915540.1 15214366 | 6 | 374779506 | 374792425 | Geotrypetes seraphini 260995 | CTG|GTATGTTTAA...AAATTCTTCTTA/AAATGATTTACA...TACAG|ATG | 1 | 1 | 12.563 |
82387228 | GT-AG | 0 | 1.000000099473604e-05 | 1807 | rna-XM_033915540.1 15214366 | 7 | 374792552 | 374794358 | Geotrypetes seraphini 260995 | TAG|GTAAGTACTG...AAGCCATTAATG/AAGCCATTAATG...TGTAG|ACA | 1 | 1 | 14.544 |
82387229 | GT-AG | 0 | 1.000000099473604e-05 | 23985 | rna-XM_033915540.1 15214366 | 8 | 374794566 | 374818550 | Geotrypetes seraphini 260995 | CAG|GTAAGTTCTA...ATATTTTTGTTT/CATATATTAATG...CCTAG|ACA | 1 | 1 | 17.799 |
82387230 | GT-AG | 0 | 2.307772543848091e-05 | 3137 | rna-XM_033915540.1 15214366 | 9 | 374818720 | 374821856 | Geotrypetes seraphini 260995 | AAG|GTAGGCAGTG...TTGTCTGTAACT/TTGTCTGTAACT...TGCAG|GTA | 2 | 1 | 20.456 |
82387231 | GT-AG | 0 | 0.2956946237966693 | 3413 | rna-XM_033915540.1 15214366 | 10 | 374822026 | 374825438 | Geotrypetes seraphini 260995 | GAG|GTATCGTGAA...AAAACCTTAACT/AATGTATTTAAA...TTTAG|TCC | 0 | 1 | 23.113 |
82387232 | GT-AG | 0 | 1.000000099473604e-05 | 5319 | rna-XM_033915540.1 15214366 | 11 | 374825659 | 374830977 | Geotrypetes seraphini 260995 | AAA|GTGAGTCAAA...ATTTTTTTTCCT/TGTATTTTCATG...TGTAG|GTA | 1 | 1 | 26.572 |
82387233 | GT-AG | 0 | 1.000000099473604e-05 | 9953 | rna-XM_033915540.1 15214366 | 12 | 374831101 | 374841053 | Geotrypetes seraphini 260995 | GAC|GTAAGTAAAC...GATGCTTTTGTG/AAGGGATTCAAT...TTTAG|ATC | 1 | 1 | 28.506 |
82387234 | GT-AG | 0 | 1.000000099473604e-05 | 10251 | rna-XM_033915540.1 15214366 | 13 | 374841276 | 374851526 | Geotrypetes seraphini 260995 | GAG|GTGAACAAAC...TTTTGCTTATAA/ATTTTGCTTATA...CTCAG|TCT | 1 | 1 | 31.997 |
82387235 | GT-AG | 0 | 1.000000099473604e-05 | 125 | rna-XM_033915540.1 15214366 | 14 | 374851833 | 374851957 | Geotrypetes seraphini 260995 | TAG|GTAAGTACTG...TTTTTCCTACCT/GTATGATTAATA...TCTAG|GTT | 1 | 1 | 36.808 |
82387236 | GC-AG | 0 | 1.000000099473604e-05 | 8969 | rna-XM_033915540.1 15214366 | 15 | 374852162 | 374861130 | Geotrypetes seraphini 260995 | AAG|GCAAGTCTTT...TTACTCTTGCAT/CTTGCATTCATC...TGCAG|CTA | 1 | 1 | 40.016 |
82387237 | GT-AG | 0 | 1.000000099473604e-05 | 3961 | rna-XM_033915540.1 15214366 | 16 | 374861251 | 374865211 | Geotrypetes seraphini 260995 | AAG|GTAAGTATCC...CTGAGCTTAATA/CTGAGCTTAATA...TCCAG|ATT | 1 | 1 | 41.903 |
82387238 | GT-AG | 0 | 1.000000099473604e-05 | 14574 | rna-XM_033915540.1 15214366 | 17 | 374865386 | 374879959 | Geotrypetes seraphini 260995 | CAG|GTAATAGTTT...TCGCTTTTAGAT/GTAAATTTTATG...TTCAG|GTA | 1 | 1 | 44.638 |
82387239 | GT-AG | 0 | 1.000000099473604e-05 | 1232 | rna-XM_033915540.1 15214366 | 18 | 374880176 | 374881407 | Geotrypetes seraphini 260995 | GTG|GTGAGTTGCT...TTCAATTTAATT/TTCAATTTAATT...TGCAG|TTC | 1 | 1 | 48.035 |
82387240 | GT-AG | 0 | 5.282069888173096e-05 | 6236 | rna-XM_033915540.1 15214366 | 19 | 374881644 | 374887879 | Geotrypetes seraphini 260995 | AAG|GTACTTAACA...GATCCTTTAGAA/ATGTCTTTCATT...TCCAG|AAT | 0 | 1 | 51.745 |
82387241 | GT-AG | 0 | 1.000000099473604e-05 | 21451 | rna-XM_033915540.1 15214366 | 20 | 374888010 | 374909460 | Geotrypetes seraphini 260995 | AAT|GTAAGACTTT...CAATTCTTACCA/CCAATTCTTACC...AACAG|GTT | 1 | 1 | 53.789 |
82387242 | GT-AG | 0 | 1.4211831044483653e-05 | 19594 | rna-XM_033915540.1 15214366 | 21 | 374909643 | 374929236 | Geotrypetes seraphini 260995 | ACT|GTAAGTAATA...AGATTCTTGCTG/ACTAATCTGACT...TTCAG|AGG | 0 | 1 | 56.651 |
82387243 | GT-AG | 0 | 1.000000099473604e-05 | 13326 | rna-XM_033915540.1 15214366 | 22 | 374929421 | 374942746 | Geotrypetes seraphini 260995 | CAG|GTAAGAGCTG...TGTATTGTAATG/ATAGAAATCACT...CGTAG|GTT | 1 | 1 | 59.544 |
82387244 | GT-AG | 0 | 0.0001959020534911 | 6391 | rna-XM_033915540.1 15214366 | 23 | 374942960 | 374949350 | Geotrypetes seraphini 260995 | AAG|GTTTGTTTGA...ATGTTTTTAAAC/AAATTTCTAATT...TTCAG|GAA | 1 | 1 | 62.893 |
82387245 | GT-AG | 0 | 1.000000099473604e-05 | 2550 | rna-XM_033915540.1 15214366 | 24 | 374949535 | 374952084 | Geotrypetes seraphini 260995 | TGG|GTAAGATAAC...TTTTTCTTACTG/GTTTTTCTTACT...AAAAG|GTT | 2 | 1 | 65.786 |
82387246 | GT-AG | 0 | 0.0116427570276862 | 92 | rna-XM_033915540.1 15214366 | 25 | 374952287 | 374952378 | Geotrypetes seraphini 260995 | AAG|GTATTCCACT...GGACTCTTATCC/TTTTTTTTCAAT...CACAG|TTG | 0 | 1 | 68.962 |
82387247 | GT-AG | 0 | 0.0012324760872758 | 5682 | rna-XM_033915540.1 15214366 | 26 | 374952544 | 374958225 | Geotrypetes seraphini 260995 | AAG|GTATGTCTTT...TGAGACTTGATT/TGAGACTTGATT...TTAAG|ATT | 0 | 1 | 71.557 |
82387248 | GT-AG | 0 | 0.0001921913685529 | 13857 | rna-XM_033915540.1 15214366 | 27 | 374958427 | 374972283 | Geotrypetes seraphini 260995 | CAG|GTACTCCTAC...TTTATTTTCATT/TTTATTTTCATT...TCTAG|GTA | 0 | 1 | 74.717 |
82387249 | GT-AG | 0 | 1.000000099473604e-05 | 6030 | rna-XM_033915540.1 15214366 | 28 | 374972825 | 374978854 | Geotrypetes seraphini 260995 | AGA|GTGAGTATAC...TTTTCCTTGCAC/CATTTGTTTATG...TGCAG|TTC | 1 | 1 | 83.223 |
82387250 | GT-AG | 0 | 1.000000099473604e-05 | 3626 | rna-XM_033915540.1 15214366 | 29 | 374978978 | 374982603 | Geotrypetes seraphini 260995 | ATG|GTAAATGTTC...ATTGTCATATTG/TATATTGTCATA...ATTAG|GAA | 1 | 1 | 85.157 |
82387251 | GT-AG | 0 | 0.0004069061552471 | 948 | rna-XM_033915540.1 15214366 | 30 | 374982709 | 374983656 | Geotrypetes seraphini 260995 | AAG|GTATTGTCTG...TCTTTCTTGCTT/CCGATATTTATT...GGCAG|GCA | 1 | 1 | 86.808 |
82387252 | GT-AG | 0 | 1.000000099473604e-05 | 6261 | rna-XM_033915540.1 15214366 | 31 | 374983894 | 374990154 | Geotrypetes seraphini 260995 | CAT|GTAAGTACTG...GCTTCATTATTA/AGCTGCTTCATT...AATAG|GTG | 1 | 1 | 90.535 |
82387253 | GT-AG | 0 | 1.000000099473604e-05 | 3471 | rna-XM_033915540.1 15214366 | 32 | 374990594 | 374994064 | Geotrypetes seraphini 260995 | ACG|GTGAGTATAT...GCAGTGTTGAAA/ATGGATGTAACT...GATAG|GAC | 2 | 1 | 97.437 |
82387254 | GT-AG | 0 | 1.000000099473604e-05 | 4017 | rna-XM_033915540.1 15214366 | 33 | 374994204 | 374998220 | Geotrypetes seraphini 260995 | CAG|GTAAGAGCAT...CTACTTTTATAT/TCTACTTTTATA...TACAG|GAA | 0 | 1 | 99.623 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);