introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
26 rows where transcript_id = 3982015
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 20494897 | GT-AG | 0 | 1.000000099473604e-05 | 6037 | rna-XM_036832070.1 3982015 | 1 | 149821807 | 149827843 | Balaenoptera musculus 9771 | TCG|GTGAGCTGGG...CCTCTCTTAGTT/TTAGTTTTGATA...TTTAG|GCC | 1 | 1 | 1.13 |
| 20494898 | GT-AG | 0 | 1.000000099473604e-05 | 1366 | rna-XM_036832070.1 3982015 | 2 | 149820261 | 149821626 | Balaenoptera musculus 9771 | AAC|GTGAGTAATC...TTTGTCTTGTTT/CAGTCCCAAATC...TATAG|GTA | 1 | 1 | 4.637 |
| 20494899 | GT-AG | 0 | 1.000000099473604e-05 | 427 | rna-XM_036832070.1 3982015 | 3 | 149819734 | 149820160 | Balaenoptera musculus 9771 | AGG|GTGAGTCACC...CTTTCCTAAAAT/TGATCACTCATC...TTTAG|ATA | 2 | 1 | 6.585 |
| 20494900 | GT-AG | 0 | 1.000000099473604e-05 | 560 | rna-XM_036832070.1 3982015 | 4 | 149819088 | 149819647 | Balaenoptera musculus 9771 | AAT|GTGAGTAGAA...CTTTTCCTATCT/TAACATTTAAAT...CCAAG|CTA | 1 | 1 | 8.26 |
| 20494901 | GT-AG | 0 | 1.000000099473604e-05 | 1895 | rna-XM_036832070.1 3982015 | 5 | 149816794 | 149818688 | Balaenoptera musculus 9771 | AGG|GTGAGTTGGG...AAGTTCTTATTT/AAAGTTCTTATT...CTCAG|GGT | 1 | 1 | 16.034 |
| 20494902 | GT-AG | 0 | 2.752895608808031e-05 | 82 | rna-XM_036832070.1 3982015 | 6 | 149816535 | 149816616 | Balaenoptera musculus 9771 | AAG|GTGCCTAAAG...TGTTTCCTGACT/TGTTTCCTGACT...CCTAG|CAA | 1 | 1 | 19.482 |
| 20494903 | GT-AG | 0 | 1.000000099473604e-05 | 636 | rna-XM_036832070.1 3982015 | 7 | 149815796 | 149816431 | Balaenoptera musculus 9771 | GGG|GTGAGTGCAA...TACCCCTTATTG/CTACCCCTTATT...TTAAG|GTA | 2 | 1 | 21.488 |
| 20494904 | GT-AG | 0 | 1.000000099473604e-05 | 1170 | rna-XM_036832070.1 3982015 | 8 | 149814540 | 149815709 | Balaenoptera musculus 9771 | AAC|GTGAGTAGAA...TGAATCTTATTG/TTGAATCTTATT...TGCAG|GGA | 1 | 1 | 23.164 |
| 20494905 | GT-AG | 0 | 1.000000099473604e-05 | 1402 | rna-XM_036832070.1 3982015 | 9 | 149812748 | 149814149 | Balaenoptera musculus 9771 | GTG|GTGAGTAGAA...TTTTTCCTAATG/TTTTTCCTAATG...TAAAG|ATA | 1 | 1 | 30.762 |
| 20494906 | GT-AG | 0 | 0.1523802390766773 | 85 | rna-XM_036832070.1 3982015 | 10 | 149812474 | 149812558 | Balaenoptera musculus 9771 | AAG|GTACCCTAAA...TCATCTTTGGCT/ACTTTGCTCATC...CTTAG|AGG | 1 | 1 | 34.444 |
| 20494907 | GT-AG | 0 | 1.000000099473604e-05 | 227 | rna-XM_036832070.1 3982015 | 11 | 149812147 | 149812373 | Balaenoptera musculus 9771 | AGG|GTGAGTGTCT...ATAGTTTTGAAT/ATAGTTTTGAAT...TCCAG|GTA | 2 | 1 | 36.392 |
| 20494908 | GT-AG | 0 | 0.0007430862524172 | 521 | rna-XM_036832070.1 3982015 | 12 | 149811543 | 149812063 | Balaenoptera musculus 9771 | AAG|GTATGTTAAG...CTCTCTGTAATT/AATCTTGTCATC...CCTAG|AAA | 1 | 1 | 38.009 |
| 20494909 | GT-AG | 0 | 1.000000099473604e-05 | 602 | rna-XM_036832070.1 3982015 | 13 | 149810533 | 149811134 | Balaenoptera musculus 9771 | AGG|GTGGGTATTC...AATTCTGTATCT/CTTTCTATCATT...TTTAG|ATT | 1 | 1 | 45.958 |
| 20494910 | GT-AG | 0 | 1.000000099473604e-05 | 79 | rna-XM_036832070.1 3982015 | 14 | 149810277 | 149810355 | Balaenoptera musculus 9771 | AAG|GTTCCAGGCT...ATTGCCTTGAGT/TGACTTCTCATC...TTTAG|TGG | 1 | 1 | 49.406 |
| 20494911 | GT-AG | 0 | 1.000000099473604e-05 | 294 | rna-XM_036832070.1 3982015 | 15 | 149809892 | 149810185 | Balaenoptera musculus 9771 | AGG|GTGAGTGAGG...AATGTCATGACT/TTAGTGGTCACC...CCTAG|GTA | 2 | 1 | 51.179 |
| 20494912 | GT-AG | 0 | 1.000000099473604e-05 | 919 | rna-XM_036832070.1 3982015 | 16 | 149808896 | 149809814 | Balaenoptera musculus 9771 | AAG|GTGAGTAGAA...TGGTGTTTAATA/TAACTTTTCATC...TCCAG|ATA | 1 | 1 | 52.679 |
| 20494913 | GT-AG | 0 | 1.000000099473604e-05 | 386 | rna-XM_036832070.1 3982015 | 17 | 149808105 | 149808490 | Balaenoptera musculus 9771 | AAG|GTAAAAACTC...GTGTTCTTGAGT/TTGGTATTTATG...TTTAG|GCT | 1 | 1 | 60.569 |
| 20494914 | GT-AG | 0 | 0.0001487129079849 | 74 | rna-XM_036832070.1 3982015 | 18 | 149807854 | 149807927 | Balaenoptera musculus 9771 | AAG|GTACTTTCAG...TCTCTTTTATAT/TTCTCTTTTATA...TCTAG|AAG | 1 | 1 | 64.017 |
| 20494915 | GT-AG | 0 | 1.000000099473604e-05 | 355 | rna-XM_036832070.1 3982015 | 19 | 149807411 | 149807765 | Balaenoptera musculus 9771 | TGG|GTAAGTATGA...TGATTTTTAGGA/CGTATTATAATC...TCTAG|GTA | 2 | 1 | 65.732 |
| 20494916 | GT-AG | 0 | 2.2252787945917965e-05 | 863 | rna-XM_036832070.1 3982015 | 20 | 149806465 | 149807327 | Balaenoptera musculus 9771 | AAG|GTAAGTTAAA...ACTTTCTTAATC/ACTTTCTTAATC...TGCAG|TTA | 1 | 1 | 67.349 |
| 20494917 | GT-AG | 0 | 1.000000099473604e-05 | 1043 | rna-XM_036832070.1 3982015 | 21 | 149805029 | 149806071 | Balaenoptera musculus 9771 | AGG|GTAAGATATT...ATCTTTTTAAAT/ATCTTTTTAAAT...CTAAG|CTT | 1 | 1 | 75.005 |
| 20494918 | GT-AG | 0 | 2.554421420747407e-05 | 1089 | rna-XM_036832070.1 3982015 | 22 | 149803754 | 149804842 | Balaenoptera musculus 9771 | AAG|GTGTGTTGTT...GATTCCTTTAAT/ATATGACTCAGC...TTCAG|AGG | 1 | 1 | 78.628 |
| 20494919 | GT-AG | 0 | 1.000000099473604e-05 | 1328 | rna-XM_036832070.1 3982015 | 23 | 149802237 | 149803564 | Balaenoptera musculus 9771 | CAA|GTAAGTACAA...ATTTTCATGATT/AATATTTTCATG...CTTAG|GCT | 1 | 1 | 82.311 |
| 20494920 | GT-AG | 0 | 5.02886526406455e-05 | 696 | rna-XM_036832070.1 3982015 | 24 | 149801517 | 149802212 | Balaenoptera musculus 9771 | GCG|GTAAGTTTTC...ATTTTTCTAGCA/TAGCATTTGATA...TCTAG|GTT | 1 | 1 | 82.778 |
| 20494921 | GT-AG | 0 | 1.6581209135064046e-05 | 4071 | rna-XM_036832070.1 3982015 | 25 | 149797370 | 149801440 | Balaenoptera musculus 9771 | ATG|GTAAGTTCAA...TTTTTCTTTTTT/CAATGAGTAAAG...TTTAG|CAA | 2 | 1 | 84.259 |
| 20508172 | GT-AG | 0 | 0.3493050533144604 | 2828 | rna-XM_036832070.1 3982015 | 26 | 149794436 | 149797263 | Balaenoptera musculus 9771 | CTG|GTATCTAATG...GTCTCCTAAATA/TAGAAATTCATC...TTCAG|TAT | 0 | 86.324 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);