introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
31 rows where transcript_id = 22607919
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 122608204 | GT-AG | 0 | 1.000000099473604e-05 | 1592 | rna-XM_021205166.2 22607919 | 1 | 155169241 | 155170832 | Mus pahari 10093 | TGG|GTGAGAAATT...TGTCTTTTAAAA/TGTCTTTTAAAA...CCCAG|AAT | 0 | 1 | 0.649 |
| 122608205 | GT-AG | 0 | 1.000000099473604e-05 | 8701 | rna-XM_021205166.2 22607919 | 2 | 155171007 | 155179707 | Mus pahari 10093 | CAG|GTACAGTAAC...GCAGCCTTATAA/GTGACCCTAACA...CTCAG|GTG | 0 | 1 | 4.413 |
| 122608206 | GT-AG | 0 | 1.000000099473604e-05 | 1617 | rna-XM_021205166.2 22607919 | 3 | 155179834 | 155181450 | Mus pahari 10093 | TGG|GTAAGGTTCA...CGGTCCTTGGTG/TTGGTGGTGACA...TCCAG|CTC | 0 | 1 | 7.138 |
| 122608207 | GT-AG | 0 | 1.000000099473604e-05 | 206 | rna-XM_021205166.2 22607919 | 4 | 155181586 | 155181791 | Mus pahari 10093 | AAG|GTAAGGGGGA...TTTGCCTTGAAT/TTTGCCTTGAAT...CACAG|GAT | 0 | 1 | 10.058 |
| 122608208 | GT-AG | 0 | 1.000000099473604e-05 | 543 | rna-XM_021205166.2 22607919 | 5 | 155181900 | 155182442 | Mus pahari 10093 | CAT|GTGAGACTCT...AAGTTCTTATTT/AAAGTTCTTATT...TTCAG|GCC | 0 | 1 | 12.395 |
| 122608209 | GT-AG | 0 | 1.000000099473604e-05 | 1347 | rna-XM_021205166.2 22607919 | 6 | 155182499 | 155183845 | Mus pahari 10093 | TAG|GTAAGAAATG...ACTGACATAGCA/AGCAGACTGACA...TCCAG|CAC | 2 | 1 | 13.606 |
| 122608210 | GT-AG | 0 | 0.0004330657755291 | 3005 | rna-XM_021205166.2 22607919 | 7 | 155184078 | 155187082 | Mus pahari 10093 | CTG|GTAACTGTGA...AGTCCTGTAATG/CTGTAATGCACA...CCTAG|GAA | 0 | 1 | 18.624 |
| 122608211 | GT-AG | 0 | 1.000000099473604e-05 | 1137 | rna-XM_021205166.2 22607919 | 8 | 155187247 | 155188383 | Mus pahari 10093 | GAA|GTGAGTATGC...TATCACTTAATC/CTGAGTATCACT...TGCAG|ATT | 2 | 1 | 22.172 |
| 122608212 | GT-AG | 0 | 1.000000099473604e-05 | 2940 | rna-XM_021205166.2 22607919 | 9 | 155188562 | 155191501 | Mus pahari 10093 | AAG|GTAAGTGGTC...CTGGTCTTATCT/TCTGGTCTTATC...TTTAG|GCA | 0 | 1 | 26.022 |
| 122608213 | GT-AG | 0 | 1.000000099473604e-05 | 964 | rna-XM_021205166.2 22607919 | 10 | 155191757 | 155192720 | Mus pahari 10093 | CAG|GTAAAGAATG...TTCGTCTTTCTG/CGGAGGTACATT...GGCAG|GTC | 0 | 1 | 31.538 |
| 122608214 | GT-AG | 0 | 1.000000099473604e-05 | 1562 | rna-XM_021205166.2 22607919 | 11 | 155192787 | 155194348 | Mus pahari 10093 | AAG|GTAAGAAGGA...GGTGCCTTCTCA/TGCCTTCTCAGG...ATTAG|ATC | 0 | 1 | 32.966 |
| 122608215 | GT-AG | 0 | 1.000000099473604e-05 | 496 | rna-XM_021205166.2 22607919 | 12 | 155194487 | 155194982 | Mus pahari 10093 | CTG|GTGAGCAAGC...AGTTTCTTATAG/GAGTTTCTTATA...CCTAG|GTG | 0 | 1 | 35.951 |
| 122608216 | GT-AG | 0 | 3.4861860039725774e-05 | 1126 | rna-XM_021205166.2 22607919 | 13 | 155195130 | 155196255 | Mus pahari 10093 | CAG|GTACATAGGA...ATTCTCTTATTC/TTATTCCTGACT...TCTAG|GCC | 0 | 1 | 39.13 |
| 122608217 | GT-AG | 0 | 2.22270113424806e-05 | 680 | rna-XM_021205166.2 22607919 | 14 | 155196341 | 155197020 | Mus pahari 10093 | TTG|GTAAACAAAT...TTTCCCTTCCTC/ACGTAACTAAAA...AATAG|ATA | 1 | 1 | 40.969 |
| 122608218 | GT-AG | 0 | 1.000000099473604e-05 | 1455 | rna-XM_021205166.2 22607919 | 15 | 155197088 | 155198542 | Mus pahari 10093 | AGA|GTGAGTTCGA...ACATCCTTCAAA/ACATCCTTCAAA...TTCAG|TGT | 2 | 1 | 42.418 |
| 122608219 | GT-AG | 0 | 1.000000099473604e-05 | 2302 | rna-XM_021205166.2 22607919 | 16 | 155198670 | 155200971 | Mus pahari 10093 | AAG|GTGAGTGGAA...ATGGTCTGATCT/GATGGTCTGATC...TTTAG|GGC | 0 | 1 | 45.165 |
| 122608220 | GT-AG | 0 | 4.2048416010534114e-05 | 1364 | rna-XM_021205166.2 22607919 | 17 | 155201149 | 155202512 | Mus pahari 10093 | AAA|GTACGTGGGG...TGTTGCTTAACA/ACGACTTTCATC...CTCAG|GGT | 0 | 1 | 48.994 |
| 122608221 | GT-AG | 0 | 1.000000099473604e-05 | 981 | rna-XM_021205166.2 22607919 | 18 | 155202681 | 155203661 | Mus pahari 10093 | AAG|GTGAGGAGGG...TGGTTTTTGTTT/AGGAGAATGATG...CACAG|ACT | 0 | 1 | 52.628 |
| 122608222 | GT-AG | 0 | 1.000000099473604e-05 | 2613 | rna-XM_021205166.2 22607919 | 19 | 155203843 | 155206455 | Mus pahari 10093 | CAG|GTGTGTAAGC...TATCCCCTAACT/TAACTCCTAACA...CTTAG|TCA | 1 | 1 | 56.543 |
| 122608223 | GT-AG | 0 | 0.0015485101581111 | 258 | rna-XM_021205166.2 22607919 | 20 | 155206583 | 155206840 | Mus pahari 10093 | CAG|GTAGCTCGTG...TTTCCCTTTCTC/TACATGCTCAAG...CTCAG|CTC | 2 | 1 | 59.291 |
| 122608224 | GT-AG | 0 | 4.384231518176341e-05 | 857 | rna-XM_021205166.2 22607919 | 21 | 155206977 | 155207833 | Mus pahari 10093 | AAG|GTAAACACTC...AATTCTTTACCC/TAATTCTTTACC...TGTAG|GTA | 0 | 1 | 62.232 |
| 122608225 | GT-AG | 0 | 0.0059510546951134 | 143 | rna-XM_021205166.2 22607919 | 22 | 155208054 | 155208196 | Mus pahari 10093 | AAG|GTATTTTTAG...ATGGTATTAACA/ATGGTATTAACA...TCTAG|GTA | 1 | 1 | 66.991 |
| 122608226 | GT-AG | 0 | 0.0006483500984529 | 2483 | rna-XM_021205166.2 22607919 | 23 | 155208352 | 155210834 | Mus pahari 10093 | GGT|GTAAGTTTCT...TTCTCCTAATCT/TTTCTCCTAATC...CCTAG|GAT | 0 | 1 | 70.344 |
| 122608227 | GT-AG | 0 | 1.000000099473604e-05 | 1651 | rna-XM_021205166.2 22607919 | 24 | 155211112 | 155212762 | Mus pahari 10093 | AGA|GTGTCAGGAT...GGGGAGAGAAAG/AGGGGAGAGAAA...AGGAG|GGA | 1 | 1 | 76.336 |
| 122608228 | GT-AG | 0 | 1.000000099473604e-05 | 2318 | rna-XM_021205166.2 22607919 | 25 | 155212833 | 155215150 | Mus pahari 10093 | CAG|GTGAGGCCTC...TGACCCTTCTCT/CAATGACTGACC...CGCAG|GTG | 2 | 1 | 77.85 |
| 122608229 | GT-AG | 0 | 1.000000099473604e-05 | 1959 | rna-XM_021205166.2 22607919 | 26 | 155215278 | 155217236 | Mus pahari 10093 | AAC|GTGAGTCTGA...TTTTTCTTCTTC/TGTGTGCTGATA...TGTAG|ATC | 0 | 1 | 80.597 |
| 122608230 | GT-AG | 0 | 1.000000099473604e-05 | 329 | rna-XM_021205166.2 22607919 | 27 | 155217339 | 155217667 | Mus pahari 10093 | GAG|GTAAGAGTTA...AATTTCTCATCC/GAATTTCTCATC...GCCAG|GCG | 0 | 1 | 82.803 |
| 122608231 | GT-AG | 0 | 1.000000099473604e-05 | 1053 | rna-XM_021205166.2 22607919 | 28 | 155217812 | 155218864 | Mus pahari 10093 | AAG|GTGGGTGATA...TGCATGTTGAAA/TGCATGTTGAAA...CACAG|GTC | 0 | 1 | 85.918 |
| 122608232 | GT-AG | 0 | 1.000000099473604e-05 | 3219 | rna-XM_021205166.2 22607919 | 29 | 155219024 | 155222242 | Mus pahari 10093 | CAG|GTTAGTTAGC...TAAGCCCTGATG/CCTCTGCTGACA...TGCAG|GAC | 0 | 1 | 89.358 |
| 122608233 | GT-AG | 0 | 1.000000099473604e-05 | 2424 | rna-XM_021205166.2 22607919 | 30 | 155222410 | 155224833 | Mus pahari 10093 | GAG|GTAAGCTACT...TGGTCTTTGGAA/TTGGAACTCATA...CGCAG|CAT | 2 | 1 | 92.97 |
| 122608234 | GT-AG | 0 | 1.000000099473604e-05 | 340 | rna-XM_021205166.2 22607919 | 31 | 155225029 | 155225368 | Mus pahari 10093 | CAA|GTGAGTGTGT...CTTTCCTTTCCT/GACTTCCTAAAG...TTCAG|GAT | 2 | 1 | 97.188 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);