introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
25 rows where transcript_id = 21436542
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, is_minor, score, length, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 115633951 | GT-AG | 0 | 1.000000099473604e-05 | 982 | rna-XM_034072702.1 21436542 | 1 | 102845359 | 102846340 | Melopsittacus undulatus 13146 | AAG|GTTAGAGCCA...GTTTCTTTATAT/TGTTTCTTTATA...TTAAG|GTC | 0 | 1 | 4.297 |
| 115633952 | AT-AC | 1 | 99.99999999673064 | 982 | rna-XM_034072702.1 21436542 | 2 | 102846460 | 102847441 | Melopsittacus undulatus 13146 | TTC|ATATCCTTTT...TTTGCCTTCACT/TTTGCCTTCACT...CTCAC|ATT | 2 | 1 | 6.326 |
| 115633953 | GT-AG | 0 | 8.092446230431844e-05 | 1564 | rna-XM_034072702.1 21436542 | 3 | 102847550 | 102849113 | Melopsittacus undulatus 13146 | TGA|GTAAGTATCT...TTTCCTTTGGCA/TAATTGCTAATA...CTTAG|ATT | 2 | 1 | 8.167 |
| 115633954 | GT-AG | 0 | 1.000000099473604e-05 | 778 | rna-XM_034072702.1 21436542 | 4 | 102849243 | 102850020 | Melopsittacus undulatus 13146 | GGC|GTAAGTGGTA...AATGTCCTGACT/AATGTCCTGACT...TACAG|GTA | 2 | 1 | 10.367 |
| 115633955 | GT-AG | 0 | 1.000000099473604e-05 | 654 | rna-XM_034072702.1 21436542 | 5 | 102850113 | 102850766 | Melopsittacus undulatus 13146 | CAG|GTAAGAAACC...TTTTTCTTCATT/TTTTTCTTCATT...CCTAG|GAC | 1 | 1 | 11.935 |
| 115633956 | GT-AG | 0 | 0.0116338315665101 | 2692 | rna-XM_034072702.1 21436542 | 6 | 102850989 | 102853680 | Melopsittacus undulatus 13146 | CAG|GTAACTTATC...TTCTTTTTAACT/TTCTTTTTAACT...TATAG|CAC | 1 | 1 | 15.72 |
| 115633957 | GT-AG | 0 | 1.000000099473604e-05 | 671 | rna-XM_034072702.1 21436542 | 7 | 102853739 | 102854409 | Melopsittacus undulatus 13146 | TGG|GTAAGTACAC...TACTTTTTATTT/TTTATTTTTACT...CTCAG|GGT | 2 | 1 | 16.709 |
| 115633958 | GT-AG | 0 | 0.0007184316054411 | 491 | rna-XM_034072702.1 21436542 | 8 | 102854552 | 102855042 | Melopsittacus undulatus 13146 | CAG|GTACCAGCTT...TATTTTTTATTT/CTATTTTTTATT...TACAG|ACC | 0 | 1 | 19.13 |
| 115633959 | GT-AG | 0 | 0.0245785499133715 | 939 | rna-XM_034072702.1 21436542 | 9 | 102855241 | 102856179 | Melopsittacus undulatus 13146 | GAG|GTATACATTA...ATGTTTTTGTTT/TTTTGTTTTAAT...TTCAG|TCA | 0 | 1 | 22.506 |
| 115633960 | GT-AG | 0 | 0.0022895346104238 | 327 | rna-XM_034072702.1 21436542 | 10 | 102856357 | 102856683 | Melopsittacus undulatus 13146 | TTG|GTAATCTATT...TTTTCCTTGTCT/TCAGAATTTATT...TTCAG|TCT | 0 | 1 | 25.524 |
| 115633961 | GT-AG | 0 | 1.000000099473604e-05 | 681 | rna-XM_034072702.1 21436542 | 11 | 102857089 | 102857769 | Melopsittacus undulatus 13146 | CTG|GTAAGGAAAA...ATCTCCCTAATC/CTTAATTTTATT...AGCAG|CCA | 0 | 1 | 32.43 |
| 115633962 | GT-AG | 0 | 1.000000099473604e-05 | 1941 | rna-XM_034072702.1 21436542 | 12 | 102857897 | 102859837 | Melopsittacus undulatus 13146 | AGG|GTAAGTGGTT...TTTTCTTTTGCA/TGATAACTAATT...TTTAG|AAC | 1 | 1 | 34.595 |
| 115633963 | GT-AG | 0 | 3.273214350700077e-05 | 344 | rna-XM_034072702.1 21436542 | 13 | 102860077 | 102860420 | Melopsittacus undulatus 13146 | TTG|GTAAGCATAC...TGCTTTTTAGAC/GATAAATTCATG...CACAG|GTT | 0 | 1 | 38.67 |
| 115633964 | GT-AG | 0 | 0.0001651210423708 | 735 | rna-XM_034072702.1 21436542 | 14 | 102860622 | 102861356 | Melopsittacus undulatus 13146 | CTG|GTAAGCTTCA...TATGTTTTATGT/CTATGTTTTATG...CGCAG|CTG | 0 | 1 | 42.097 |
| 115633965 | GT-AG | 0 | 1.000000099473604e-05 | 1670 | rna-XM_034072702.1 21436542 | 15 | 102861711 | 102863380 | Melopsittacus undulatus 13146 | GTG|GTGAGTTTTT...GTGACTTTAAAC/TGTTATTTCAAT...TTCAG|GTT | 0 | 1 | 48.133 |
| 115633966 | GT-AG | 0 | 1.000000099473604e-05 | 2158 | rna-XM_034072702.1 21436542 | 16 | 102863819 | 102865976 | Melopsittacus undulatus 13146 | CAG|GTAATGAATT...CCTTTCTTATTT/ACCTTTCTTATT...TGTAG|TTT | 0 | 1 | 55.601 |
| 115633967 | GT-AG | 0 | 1.000000099473604e-05 | 3044 | rna-XM_034072702.1 21436542 | 17 | 102866098 | 102869141 | Melopsittacus undulatus 13146 | AAG|GTAAGTCTTT...TGTACATTACTT/CATACGTTCATT...TTTAG|GTT | 1 | 1 | 57.664 |
| 115633968 | GT-AG | 0 | 1.000000099473604e-05 | 675 | rna-XM_034072702.1 21436542 | 18 | 102869297 | 102869971 | Melopsittacus undulatus 13146 | CTG|GTTAGTATAG...TCATCTCTGACT/ATTTATCTCAAT...TTCAG|GCA | 0 | 1 | 60.307 |
| 115633969 | GT-AG | 0 | 0.0007601413881528 | 262 | rna-XM_034072702.1 21436542 | 19 | 102870146 | 102870407 | Melopsittacus undulatus 13146 | GGA|GTAAGTTCTG...AATGCTTTATTT/TAATGCTTTATT...TTCAG|TTC | 0 | 1 | 63.274 |
| 115633970 | GT-AG | 0 | 1.000000099473604e-05 | 1609 | rna-XM_034072702.1 21436542 | 20 | 102870516 | 102872124 | Melopsittacus undulatus 13146 | AGG|GTAAGACTGA...CTATCTTTAAGG/AGGTTTTTGAGC...CCAAG|GTG | 0 | 1 | 65.115 |
| 115633971 | GT-AG | 0 | 1.000000099473604e-05 | 485 | rna-XM_034072702.1 21436542 | 21 | 102872404 | 102872888 | Melopsittacus undulatus 13146 | GTG|GTAAGTGCTG...TTTTCTATAATC/AAAACTTTTACC...TCTAG|GCA | 0 | 1 | 69.872 |
| 115633972 | GT-AG | 0 | 1.000000099473604e-05 | 1468 | rna-XM_034072702.1 21436542 | 22 | 102872943 | 102874410 | Melopsittacus undulatus 13146 | GAG|GTCAGTTGTT...TGCGTCTTATTT/TGATTTTTCATG...TGCAG|AAA | 0 | 1 | 70.793 |
| 115633973 | GT-AG | 0 | 0.2877327709765469 | 621 | rna-XM_034072702.1 21436542 | 23 | 102874553 | 102875173 | Melopsittacus undulatus 13146 | TAA|GTATACTGCT...ATTTCTTCAAAT/CATTTCTTCAAA...CTTAG|GTG | 1 | 1 | 73.214 |
| 115633974 | GT-AG | 0 | 1.000000099473604e-05 | 391 | rna-XM_034072702.1 21436542 | 24 | 102875275 | 102875665 | Melopsittacus undulatus 13146 | CTG|GTAAGACTGT...TATTTCTTTTTT/CCTACTTTCATA...TTTAG|AAC | 0 | 1 | 74.936 |
| 115633975 | GT-AG | 0 | 1.000000099473604e-05 | 1425 | rna-XM_034072702.1 21436542 | 25 | 102875937 | 102877361 | Melopsittacus undulatus 13146 | TTG|GTAAGTAAAA...TTATCCTTTCTA/TGATTTCTAACC...TTTAG|CCT | 1 | 1 | 79.557 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);