introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
37 rows where transcript_id = 21436556
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 115634383 | GT-AG | 0 | 1.000000099473604e-05 | 775 | rna-XM_034060462.1 21436556 | 1 | 102062027 | 102062801 | Melopsittacus undulatus 13146 | CAG|GTAAGGCAGC...CTGTCTTTGGCC/TTGGCCATCAGT...CTCAG|GAC | 0 | 1 | 3.759 |
| 115634384 | GT-AG | 0 | 1.000000099473604e-05 | 954 | rna-XM_034060462.1 21436556 | 2 | 102062943 | 102063896 | Melopsittacus undulatus 13146 | AAG|GTACAAGTCA...TCTACATTCACC/ATTACATTCATT...GACAG|CTT | 0 | 1 | 6.478 |
| 115634385 | GT-AG | 0 | 1.000000099473604e-05 | 504 | rna-XM_034060462.1 21436556 | 3 | 102064054 | 102064557 | Melopsittacus undulatus 13146 | CAG|GTGAGGATTC...GTTTCTTTGTTT/CTTTGTTTCACT...TGCAG|TGG | 1 | 1 | 9.505 |
| 115634386 | GT-AG | 0 | 0.0112065806467936 | 1024 | rna-XM_034060462.1 21436556 | 4 | 102064669 | 102065692 | Melopsittacus undulatus 13146 | AAG|GTAACTTCTA...TAATTTTTAATT/TAATTTTTAATT...TGCAG|GAA | 1 | 1 | 11.644 |
| 115634387 | GT-AG | 0 | 0.0014999348707132 | 254 | rna-XM_034060462.1 21436556 | 5 | 102065887 | 102066140 | Melopsittacus undulatus 13146 | AAG|GTACATTTTA...ATCTCTTTGTTT/CCATGTTTAATT...CGAAG|AAA | 0 | 1 | 15.385 |
| 115634388 | GT-AG | 0 | 0.0080485313811347 | 233 | rna-XM_034060462.1 21436556 | 6 | 102066362 | 102066594 | Melopsittacus undulatus 13146 | AGG|GTATGCAGTA...TTTTTTTTCTCT/AGTGAGGTAATG...TTCAG|AGA | 2 | 1 | 19.645 |
| 115634389 | GT-AG | 0 | 1.000000099473604e-05 | 398 | rna-XM_034060462.1 21436556 | 7 | 102066677 | 102067074 | Melopsittacus undulatus 13146 | GAG|GTGAGAACTT...GTTTTCTTTTTA/TTTCTTTTTAAA...TGAAG|ATG | 0 | 1 | 21.226 |
| 115634390 | GT-AG | 0 | 4.390432573291285e-05 | 821 | rna-XM_034060462.1 21436556 | 8 | 102067163 | 102067983 | Melopsittacus undulatus 13146 | CAG|GTAAGCATTC...GAGGTTTTAATT/ATTGTTTTCAAT...CCAAG|GAA | 1 | 1 | 22.923 |
| 115634391 | GT-AG | 0 | 1.000000099473604e-05 | 757 | rna-XM_034060462.1 21436556 | 9 | 102068158 | 102068914 | Melopsittacus undulatus 13146 | CAT|GTCAGTACTG...CAAGCCTTCACT/TGTAGTCTGACT...TGCAG|TGC | 1 | 1 | 26.277 |
| 115634392 | GT-AG | 0 | 1.000000099473604e-05 | 1468 | rna-XM_034060462.1 21436556 | 10 | 102069052 | 102070519 | Melopsittacus undulatus 13146 | GGG|GTGAGTGATT...TTTTTTTTAATA/TTTTTTTTAATA...CGTAG|GCT | 0 | 1 | 28.918 |
| 115634393 | GT-AG | 0 | 1.000000099473604e-05 | 1181 | rna-XM_034060462.1 21436556 | 11 | 102070613 | 102071793 | Melopsittacus undulatus 13146 | GAG|GTTAGTACCA...GCTGTTTTACTT/TGCTGTTTTACT...TTCAG|GTA | 0 | 1 | 30.711 |
| 115634394 | GT-AG | 0 | 1.0247296007092146e-05 | 8643 | rna-XM_034060462.1 21436556 | 12 | 102071885 | 102080527 | Melopsittacus undulatus 13146 | CAG|GTTTGTCTCT...ACAACGTTAATA/TAAACACTAATA...TCAAG|GGG | 1 | 1 | 32.466 |
| 115634395 | GT-AG | 0 | 0.0108170530775754 | 1783 | rna-XM_034060462.1 21436556 | 13 | 102080691 | 102082473 | Melopsittacus undulatus 13146 | CAC|GTATGTAGCT...ATTTTTTTAATC/ATTTTTTTAATC...TGCAG|CCA | 2 | 1 | 35.608 |
| 115634396 | GT-AG | 0 | 0.000153074622781 | 1929 | rna-XM_034060462.1 21436556 | 14 | 102082667 | 102084595 | Melopsittacus undulatus 13146 | GAG|GTATGGCTTG...TCCTCCTTCATC/GATTACTTCACA...TGTAG|CTG | 0 | 1 | 39.329 |
| 115634397 | GT-AG | 0 | 4.764926024897767e-05 | 522 | rna-XM_034060462.1 21436556 | 15 | 102084655 | 102085176 | Melopsittacus undulatus 13146 | TTG|GTAAGCTGAG...AGACACTTAATA/CACTTAATAATT...TACAG|TTT | 2 | 1 | 40.467 |
| 115634398 | GT-AG | 0 | 0.0003274300206153 | 1706 | rna-XM_034060462.1 21436556 | 16 | 102085352 | 102087057 | Melopsittacus undulatus 13146 | AAG|GTTTGTTTGC...CTCATTTTAATT/AATTGTCTCATT...TTTAG|ATG | 0 | 1 | 43.84 |
| 115634399 | GT-AG | 0 | 1.469155271230668e-05 | 205 | rna-XM_034060462.1 21436556 | 17 | 102087152 | 102087356 | Melopsittacus undulatus 13146 | TAG|GTAAAGTTTC...GGACCCTGAACT/AGGACCCTGAAC...TCTAG|ATG | 1 | 1 | 45.653 |
| 115634400 | GT-AG | 0 | 1.000000099473604e-05 | 301 | rna-XM_034060462.1 21436556 | 18 | 102087488 | 102087788 | Melopsittacus undulatus 13146 | AAG|GTACGCAAGT...TCTATTCTGACA/TCTATTCTGACA...CCTAG|GGT | 0 | 1 | 48.178 |
| 115634401 | GT-AG | 0 | 1.000000099473604e-05 | 1802 | rna-XM_034060462.1 21436556 | 19 | 102087948 | 102089749 | Melopsittacus undulatus 13146 | AAG|GTAAGTTACT...TTATACTTGCTA/GCTATGCAGACA...CCCAG|GTA | 0 | 1 | 51.243 |
| 115634402 | GT-AG | 0 | 1.000000099473604e-05 | 248 | rna-XM_034060462.1 21436556 | 20 | 102089890 | 102090137 | Melopsittacus undulatus 13146 | AAG|GTAAGGGGGA...ATCTCATTAGCA/TGTGTGTTTATT...TCCAG|TCA | 2 | 1 | 53.943 |
| 115634403 | GT-AG | 0 | 1.000000099473604e-05 | 1616 | rna-XM_034060462.1 21436556 | 21 | 102090292 | 102091907 | Melopsittacus undulatus 13146 | AAG|GTAAATGCTC...CTATCCTTTCTG/AGTCAGCTGAAT...GGCAG|CTC | 0 | 1 | 56.912 |
| 115634404 | GT-AG | 0 | 1.674642826188475e-05 | 1574 | rna-XM_034060462.1 21436556 | 22 | 102092019 | 102093592 | Melopsittacus undulatus 13146 | CTG|GTAGGTTTAA...GACTCTTTGCCT/ATTTGCTTCATT...TTCAG|GGT | 0 | 1 | 59.051 |
| 115634405 | GT-AG | 0 | 1.000000099473604e-05 | 1021 | rna-XM_034060462.1 21436556 | 23 | 102093724 | 102094744 | Melopsittacus undulatus 13146 | CAG|GTAAATCAAG...TATGCTTTAATA/TATGCTTTAATA...GACAG|TGA | 2 | 1 | 61.577 |
| 115634406 | GT-AG | 0 | 0.0008700990963354 | 468 | rna-XM_034060462.1 21436556 | 24 | 102094931 | 102095398 | Melopsittacus undulatus 13146 | CCC|GTAAGTCTTG...GCTTTTTTAATT/GCTTTTTTAATT...TTAAG|CTC | 2 | 1 | 65.163 |
| 115634407 | GT-AG | 0 | 1.000000099473604e-05 | 1214 | rna-XM_034060462.1 21436556 | 25 | 102095467 | 102096680 | Melopsittacus undulatus 13146 | CAG|GTACGGTAAT...TGCTTCTTTGCT/CATTTCCTCAGA...TTCAG|AGT | 1 | 1 | 66.474 |
| 115634408 | GT-AG | 0 | 1.000000099473604e-05 | 518 | rna-XM_034060462.1 21436556 | 26 | 102096833 | 102097350 | Melopsittacus undulatus 13146 | AAG|GTAAGTGGGG...TTTATCTTGATT/TTTATCTTGATT...AATAG|GTG | 0 | 1 | 69.404 |
| 115634409 | GT-AG | 0 | 2.840200691951572e-05 | 924 | rna-XM_034060462.1 21436556 | 27 | 102097455 | 102098378 | Melopsittacus undulatus 13146 | CAG|GTACAGTGCA...CCACTCTTATTT/CATGTACTGACA...CTCAG|GTT | 2 | 1 | 71.409 |
| 115634410 | GT-AG | 0 | 1.000000099473604e-05 | 836 | rna-XM_034060462.1 21436556 | 28 | 102098450 | 102099285 | Melopsittacus undulatus 13146 | TTG|GTAAGAATGA...TTGTCCTTTCTC/GAGTCTTTCAGT...TTCAG|ACA | 1 | 1 | 72.778 |
| 115634411 | GT-AG | 0 | 1.000000099473604e-05 | 989 | rna-XM_034060462.1 21436556 | 29 | 102099504 | 102100492 | Melopsittacus undulatus 13146 | GAG|GTCAGCTGTT...ACAGTTTTAACC/ACAGTTTTAACC...TGCAG|CCT | 0 | 1 | 76.981 |
| 115634412 | GT-AG | 0 | 1.000000099473604e-05 | 1835 | rna-XM_034060462.1 21436556 | 30 | 102100628 | 102102462 | Melopsittacus undulatus 13146 | ATA|GTAAGATTGT...GCTCCCTGCACT/GCACTACTCAAC...TGCAG|GTG | 0 | 1 | 79.584 |
| 115634413 | GT-AG | 0 | 1.000000099473604e-05 | 1177 | rna-XM_034060462.1 21436556 | 31 | 102102586 | 102103762 | Melopsittacus undulatus 13146 | AAG|GTAAACTGAC...GGAGGTTTATTG/TGGAGGTTTATT...TTCAG|CAT | 0 | 1 | 81.955 |
| 115634414 | GT-AG | 0 | 6.054127284324926e-05 | 91 | rna-XM_034060462.1 21436556 | 32 | 102103858 | 102103948 | Melopsittacus undulatus 13146 | TCT|GTAAGTACCT...TGATTCTTTACA/CCAGCTCTGATT...CTCAG|GTT | 2 | 1 | 83.786 |
| 115634415 | GT-AG | 0 | 2.039646979882708e-05 | 527 | rna-XM_034060462.1 21436556 | 33 | 102104034 | 102104560 | Melopsittacus undulatus 13146 | GGA|GTAAGTATTT...CCAGTTCTGACT/TTCTGACTCATT...ACCAG|GTT | 0 | 1 | 85.425 |
| 115634416 | GT-AG | 0 | 3.093003222084759e-05 | 892 | rna-XM_034060462.1 21436556 | 34 | 102104762 | 102105653 | Melopsittacus undulatus 13146 | CTG|GTAAGCTCAA...GTCTTCTTATGT/TGTCTTCTTATG...CACAG|GTG | 0 | 1 | 89.3 |
| 115634417 | GT-AG | 0 | 1.000000099473604e-05 | 990 | rna-XM_034060462.1 21436556 | 35 | 102105816 | 102106805 | Melopsittacus undulatus 13146 | CAG|GTAAGATCAG...TTTCCCTTTCCT/ACAAAAATCAGG...TGCAG|CTG | 0 | 1 | 92.423 |
| 115634418 | GT-AG | 0 | 0.0797917092702018 | 579 | rna-XM_034060462.1 21436556 | 36 | 102106960 | 102107538 | Melopsittacus undulatus 13146 | TGG|GTATTCTGCA...CTGTGTTTGATA/CTGTGTTTGATA...TGCAG|ATG | 1 | 1 | 95.392 |
| 115634419 | GT-AG | 0 | 1.8025077604363007e-05 | 2621 | rna-XM_034060462.1 21436556 | 37 | 102107660 | 102110280 | Melopsittacus undulatus 13146 | AAG|GTACAGTTAA...CTCCCTGTAACT/CTGTAACTGAAC...TTCAG|GAG | 2 | 1 | 97.725 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);