introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
23 rows where transcript_id = 21436568
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 115634758 | GT-AG | 0 | 1.000000099473604e-05 | 2469 | rna-XM_005152550.3 21436568 | 3 | 41026013 | 41028481 | Melopsittacus undulatus 13146 | AAA|GTAAGAAAGT...TTTTTTTTTTTT/CCTCTTCCCATT...CCCAG|TGT | 2 | 1 | 6.081 |
| 115634759 | GT-AG | 0 | 1.000000099473604e-05 | 267 | rna-XM_005152550.3 21436568 | 4 | 41028609 | 41028875 | Melopsittacus undulatus 13146 | ACA|GTAAGTGCAT...TTGCTTTTACTC/TTTATTTTCATT...TGTAG|GAT | 0 | 1 | 8.605 |
| 115634760 | GT-AG | 0 | 1.000000099473604e-05 | 10673 | rna-XM_005152550.3 21436568 | 5 | 41029047 | 41039719 | Melopsittacus undulatus 13146 | GTG|GTAAGTAAAT...TCTTTTTTAATA/TATTTTTTTATT...CCCAG|GAA | 0 | 1 | 12.003 |
| 115634761 | GT-AG | 0 | 0.0038824244165216 | 1048 | rna-XM_005152550.3 21436568 | 6 | 41039923 | 41040970 | Melopsittacus undulatus 13146 | ACA|GTAGGTTTGC...TTCTTTTTACTC/GTTCTTTTTACT...AAAAG|CTT | 2 | 1 | 16.037 |
| 115634762 | GT-AG | 0 | 1.000000099473604e-05 | 4527 | rna-XM_005152550.3 21436568 | 7 | 41041407 | 41045933 | Melopsittacus undulatus 13146 | AGG|GTAAGTTAAC...ATTTTTCTATTT/CAACTGCTAATA...TCCAG|CTT | 0 | 1 | 24.702 |
| 115634763 | GT-AG | 0 | 1.000000099473604e-05 | 4509 | rna-XM_005152550.3 21436568 | 8 | 41046105 | 41050613 | Melopsittacus undulatus 13146 | CAG|GTAATACACA...TACTTTTTGAAT/TACTTTTTGAAT...TCTAG|GGA | 0 | 1 | 28.1 |
| 115634764 | GT-AG | 0 | 2.0948703907994935e-05 | 2420 | rna-XM_005152550.3 21436568 | 9 | 41050799 | 41053218 | Melopsittacus undulatus 13146 | GAA|GTAAGTTATG...AAGTTATTATCT/TATCTTCTAAAA...TGTAG|GTG | 2 | 1 | 31.777 |
| 115634765 | GT-AG | 0 | 1.000000099473604e-05 | 76 | rna-XM_005152550.3 21436568 | 10 | 41053406 | 41053481 | Melopsittacus undulatus 13146 | GAG|GTAAGTAAGT...TAAGCCTTTGTG/TAACAACTAAAT...CTCAG|TGG | 0 | 1 | 35.493 |
| 115634766 | GT-AG | 0 | 0.0001014351647292 | 878 | rna-XM_005152550.3 21436568 | 11 | 41053564 | 41054441 | Melopsittacus undulatus 13146 | TCA|GTAAGTACTT...TGGTTTTTAAAT/TGGTTTTTAAAT...CCCAG|GGA | 1 | 1 | 37.122 |
| 115634767 | GT-AG | 0 | 0.0559051015705138 | 1042 | rna-XM_005152550.3 21436568 | 12 | 41054504 | 41055545 | Melopsittacus undulatus 13146 | TGT|GTATGTAGTC...GTCATCTTGATA/ATATTGCTTATT...TACAG|ACA | 0 | 1 | 38.355 |
| 115634768 | GT-AG | 0 | 1.000000099473604e-05 | 2430 | rna-XM_005152550.3 21436568 | 13 | 41055650 | 41058079 | Melopsittacus undulatus 13146 | CAG|GTAAAGCACC...TTAATCTTACTC/TTTAATCTTACT...TTAAG|GGT | 2 | 1 | 40.421 |
| 115634769 | GT-AG | 0 | 0.010129927824921 | 6446 | rna-XM_005152550.3 21436568 | 14 | 41058207 | 41064652 | Melopsittacus undulatus 13146 | AAG|GTATTTTTGT...ATTCTGTTAACT/GTTTGTTTCATA...TGTAG|GCC | 0 | 1 | 42.945 |
| 115634770 | GT-AG | 0 | 1.000000099473604e-05 | 3505 | rna-XM_005152550.3 21436568 | 15 | 41066569 | 41070073 | Melopsittacus undulatus 13146 | GAG|GTACTGTAAG...TATTCATTTTCT/AGCCTATTCATT...TTAAG|TCC | 2 | 1 | 81.021 |
| 115634771 | GT-AG | 0 | 1.000000099473604e-05 | 2258 | rna-XM_005152550.3 21436568 | 16 | 41070234 | 41072491 | Melopsittacus undulatus 13146 | CAG|GTAAAGCTGA...CATTTCCTATTT/TCCTATTTCAGT...TTCAG|ACC | 0 | 1 | 84.201 |
| 115634772 | GT-AG | 0 | 4.103177824681927e-05 | 121 | rna-XM_005152550.3 21436568 | 17 | 41072601 | 41072721 | Melopsittacus undulatus 13146 | AGG|GTAAGCAAAG...TGATCCTTAGTA/TTTCTATTAACT...CACAG|ATT | 1 | 1 | 86.367 |
| 115634773 | GT-AG | 0 | 0.0002337037561176 | 1930 | rna-XM_005152550.3 21436568 | 18 | 41072963 | 41074892 | Melopsittacus undulatus 13146 | GCA|GTAAGTATGG...TTTTTTTTATCA/TTTTTTATCATT...TTTAG|TGA | 2 | 1 | 91.157 |
| 115634774 | GT-AG | 0 | 1.000000099473604e-05 | 1309 | rna-XM_005152550.3 21436568 | 19 | 41074945 | 41076253 | Melopsittacus undulatus 13146 | AGA|GTGAGTAGAT...GGTGTCTTACTT/AGGTGTCTTACT...TTCAG|ACT | 0 | 1 | 92.19 |
| 115634775 | GT-AG | 0 | 2.954810205855715e-05 | 1381 | rna-XM_005152550.3 21436568 | 20 | 41076302 | 41077682 | Melopsittacus undulatus 13146 | TTG|GTAAGTATTT...TACCTTTTATCT/CATTTAATCATA...TATAG|ATG | 0 | 1 | 93.144 |
| 115634776 | GT-AG | 0 | 0.0004112668111329 | 1096 | rna-XM_005152550.3 21436568 | 21 | 41077742 | 41078837 | Melopsittacus undulatus 13146 | AGA|GTAAGTATGG...AGTTCCTTGATT/CTTTTTTTTAGT...TTCAG|TTT | 2 | 1 | 94.316 |
| 115634777 | GT-AG | 0 | 1.000000099473604e-05 | 2884 | rna-XM_005152550.3 21436568 | 22 | 41078975 | 41081858 | Melopsittacus undulatus 13146 | GTG|GTAAGTGTCA...ATTATCTTAAAT/CTGTAGTTGATT...TTTAG|CTT | 1 | 1 | 97.039 |
| 115634778 | GT-AG | 0 | 1.000000099473604e-05 | 338 | rna-XM_005152550.3 21436568 | 23 | 41081930 | 41082267 | Melopsittacus undulatus 13146 | AAG|GTAAGGAATT...GATCCCATAGTT/ATAGTTCTAATG...TGTAG|GCA | 0 | 1 | 98.45 |
| 115645513 | GT-AG | 0 | 3.426088160141214e-05 | 329 | rna-XM_005152550.3 21436568 | 1 | 41023670 | 41023998 | Melopsittacus undulatus 13146 | TGG|GTAAGTTGTA...GACATTTTGATA/GACATTTTGATA...TGCAG|ATC | 0 | 2.365 | |
| 115645514 | GT-AG | 0 | 1.3306701168600724e-05 | 1770 | rna-XM_005152550.3 21436568 | 2 | 41024115 | 41025884 | Melopsittacus undulatus 13146 | AAG|GTAAATCTTT...TTTTTATTAATT/TTTTTATTAATT...TTCAG|TAT | 0 | 4.67 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);