introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
24 rows where transcript_id = 15319320
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 82923664 | GT-AG | 0 | 1.000000099473604e-05 | 1555 | rna-gnl|WGS:VWPO|GLAPRA_R12986_mrna 15319320 | 1 | 2501058 | 2502612 | Glareola pratincola 43316 | CAG|GTAAAGTTCT...CTTGTTTTGCCT/AAAGTGCCCACT...GCTAG|GAA | 1 | 1 | 2.182 |
| 82923665 | GT-AG | 0 | 1.000000099473604e-05 | 793 | rna-gnl|WGS:VWPO|GLAPRA_R12986_mrna 15319320 | 2 | 2500141 | 2500933 | Glareola pratincola 43316 | CAG|GTACTAGAGA...CCATCTTTATTT/AATATTGTGACC...TTCAG|GTT | 2 | 1 | 5.606 |
| 82923666 | GT-AG | 0 | 0.0014661404158973 | 769 | rna-gnl|WGS:VWPO|GLAPRA_R12986_mrna 15319320 | 3 | 2499176 | 2499944 | Glareola pratincola 43316 | CAG|GTATGCAACA...TTTTTCTTTTTT/GTGAAACTCAGA...TACAG|AGT | 0 | 1 | 11.019 |
| 82923667 | GT-AG | 0 | 1.1468861532425655e-05 | 1079 | rna-gnl|WGS:VWPO|GLAPRA_R12986_mrna 15319320 | 4 | 2498001 | 2499079 | Glareola pratincola 43316 | CAG|GTAAACAAGT...ATACCCTCAACA/CATACCCTCAAC...GGCAG|CCT | 0 | 1 | 13.67 |
| 82923668 | GT-AG | 0 | 1.000000099473604e-05 | 501 | rna-gnl|WGS:VWPO|GLAPRA_R12986_mrna 15319320 | 5 | 2497359 | 2497859 | Glareola pratincola 43316 | AGA|GTGAGTGGCG...CTTGCTTTGCTG/GCTTTGCTGAAT...CCTAG|ATG | 0 | 1 | 17.564 |
| 82923669 | GT-AG | 0 | 1.000000099473604e-05 | 214 | rna-gnl|WGS:VWPO|GLAPRA_R12986_mrna 15319320 | 6 | 2497002 | 2497215 | Glareola pratincola 43316 | GAG|GTGAGGGCCT...TGATTTTTATTT/TTTTTTCTGATC...TTTAG|GGG | 2 | 1 | 21.513 |
| 82923670 | GT-AG | 0 | 1.6061859002497328e-05 | 1953 | rna-gnl|WGS:VWPO|GLAPRA_R12986_mrna 15319320 | 7 | 2494949 | 2496901 | Glareola pratincola 43316 | GAG|GTACATAACC...CTTCTCTTTTCT/GGACTGTTCATG...GATAG|GTG | 0 | 1 | 24.275 |
| 82923671 | GT-AG | 0 | 1.000000099473604e-05 | 1906 | rna-gnl|WGS:VWPO|GLAPRA_R12986_mrna 15319320 | 8 | 2492881 | 2494786 | Glareola pratincola 43316 | AAG|GTAAAGTCCT...TATATCTTATTA/TTATATCTTATT...TCTAG|ATC | 0 | 1 | 28.749 |
| 82923672 | GT-AG | 0 | 1.000000099473604e-05 | 702 | rna-gnl|WGS:VWPO|GLAPRA_R12986_mrna 15319320 | 9 | 2492026 | 2492727 | Glareola pratincola 43316 | GCA|GTAAGTGAAT...ACTGTCATAATG/AGAATGCTTACT...CTCAG|TTT | 0 | 1 | 32.974 |
| 82923673 | GT-AG | 0 | 1.000000099473604e-05 | 1435 | rna-gnl|WGS:VWPO|GLAPRA_R12986_mrna 15319320 | 10 | 2490360 | 2491794 | Glareola pratincola 43316 | AAG|GTAGGATGGC...ATTTTCTTGTCT/GTCTTTTTCAAT...CTCAG|GTG | 0 | 1 | 39.354 |
| 82923674 | GT-AG | 0 | 1.000000099473604e-05 | 585 | rna-gnl|WGS:VWPO|GLAPRA_R12986_mrna 15319320 | 11 | 2489700 | 2490284 | Glareola pratincola 43316 | AAG|GTAAGGGGTC...TTACCTTTGACA/CAGGTGTTTACC...TCCAG|ATT | 0 | 1 | 41.425 |
| 82923675 | GT-AG | 0 | 0.0001309481124854 | 940 | rna-gnl|WGS:VWPO|GLAPRA_R12986_mrna 15319320 | 12 | 2488676 | 2489615 | Glareola pratincola 43316 | CAG|GTATGTCCTG...GATCTCTTGCAC/AGTTAATTAATT...CACAG|CTT | 0 | 1 | 43.745 |
| 82923676 | GT-AG | 0 | 0.0006119753508802 | 408 | rna-gnl|WGS:VWPO|GLAPRA_R12986_mrna 15319320 | 13 | 2488159 | 2488566 | Glareola pratincola 43316 | AAG|GTATGTTGCA...TGCTTCTCACCT/CTGCTTCTCACC...TGTAG|ATA | 1 | 1 | 46.755 |
| 82923677 | GT-AG | 0 | 1.000000099473604e-05 | 657 | rna-gnl|WGS:VWPO|GLAPRA_R12986_mrna 15319320 | 14 | 2487323 | 2487979 | Glareola pratincola 43316 | CTG|GTAAGTGGCA...TTATCTTTGTCC/TTTGTCCTCAGT...TACAG|CTG | 0 | 1 | 51.698 |
| 82923678 | GT-AG | 0 | 0.0109288768491083 | 1306 | rna-gnl|WGS:VWPO|GLAPRA_R12986_mrna 15319320 | 15 | 2485890 | 2487195 | Glareola pratincola 43316 | GTG|GTATGCAGTG...GTTGGCTTATTT/TGTTGGCTTATT...TCCAG|ACA | 1 | 1 | 55.206 |
| 82923679 | GT-AG | 0 | 1.000000099473604e-05 | 1284 | rna-gnl|WGS:VWPO|GLAPRA_R12986_mrna 15319320 | 16 | 2484460 | 2485743 | Glareola pratincola 43316 | CAG|GTACAGGTGC...TGTTTCTTCCCT/AAGCAATTCACA...TTCAG|GAT | 0 | 1 | 59.238 |
| 82923680 | GT-AG | 0 | 1.000000099473604e-05 | 624 | rna-gnl|WGS:VWPO|GLAPRA_R12986_mrna 15319320 | 17 | 2483662 | 2484285 | Glareola pratincola 43316 | GAG|GTGAGTGAGA...TTCTCCTAAAAC/ATTCTCCTAAAA...TTCAG|CTC | 0 | 1 | 64.043 |
| 82923681 | GT-AG | 0 | 1.7574051793640918e-05 | 665 | rna-gnl|WGS:VWPO|GLAPRA_R12986_mrna 15319320 | 18 | 2482808 | 2483472 | Glareola pratincola 43316 | AAG|GTAGGCCTTA...TTGTTTTTTAAT/TTGTTTTTTAAT...TGCAG|CAC | 0 | 1 | 69.263 |
| 82923682 | GT-AG | 0 | 1.000000099473604e-05 | 435 | rna-gnl|WGS:VWPO|GLAPRA_R12986_mrna 15319320 | 19 | 2482184 | 2482618 | Glareola pratincola 43316 | TTG|GTTCCAGCAA...TCTTTCTCATTT/CTCTTTCTCATT...CACAG|TCT | 0 | 1 | 74.482 |
| 82923683 | GT-AG | 0 | 1.000000099473604e-05 | 4171 | rna-gnl|WGS:VWPO|GLAPRA_R12986_mrna 15319320 | 20 | 2477697 | 2481867 | Glareola pratincola 43316 | CAG|GTAAGTACTG...AAAGTCTGACAT/GAAAGTCTGACA...GGCAG|CTC | 1 | 1 | 83.209 |
| 82923684 | GT-AG | 0 | 9.769596615996702e-05 | 1223 | rna-gnl|WGS:VWPO|GLAPRA_R12986_mrna 15319320 | 21 | 2476418 | 2477640 | Glareola pratincola 43316 | AAG|GTAAGCTCTC...GTTTTCTTCATT/GTTTTCTTCATT...GACAG|CTG | 0 | 1 | 84.756 |
| 82923685 | GT-AG | 0 | 0.0002601065536248 | 399 | rna-gnl|WGS:VWPO|GLAPRA_R12986_mrna 15319320 | 22 | 2475852 | 2476250 | Glareola pratincola 43316 | GGC|GTTCCACAGC...CAGCCTTTCATG/CAGCCTTTCATG...TTCAG|CTG | 2 | 1 | 89.368 |
| 82923686 | GT-AG | 0 | 7.495025375678856e-05 | 1386 | rna-gnl|WGS:VWPO|GLAPRA_R12986_mrna 15319320 | 23 | 2474330 | 2475715 | Glareola pratincola 43316 | CCA|GTAAGTCCTC...GTGGCTTTGAAG/GTGGCTTTGAAG...TTCAG|AAA | 0 | 1 | 93.123 |
| 82923687 | GT-AG | 0 | 1.000000099473604e-05 | 726 | rna-gnl|WGS:VWPO|GLAPRA_R12986_mrna 15319320 | 24 | 2473532 | 2474257 | Glareola pratincola 43316 | GCG|GTAAGTAGAT...CAGTGCTGAGCG/TGCTGACTCATT...CACAG|CTC | 0 | 1 | 95.112 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);