introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
29 rows where transcript_id = 21436584
This data as json, CSV (advanced)
Suggested facets: is_minor, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 115635113 | GT-AG | 0 | 1.000000099473604e-05 | 6588 | rna-XM_031052854.2 21436584 | 1 | 44760177 | 44766764 | Melopsittacus undulatus 13146 | AAG|GTCGGTGCCG...ACTATTTTAACT/ACTATTTTAACT...TCCAG|CTC | 1 | 1 | 3.589 |
| 115635114 | GT-AG | 0 | 1.000000099473604e-05 | 2648 | rna-XM_031052854.2 21436584 | 2 | 44766960 | 44769607 | Melopsittacus undulatus 13146 | GTG|GTAAGTAATA...GTGTTTTTAGTT/TTTAGTTTAATA...GTCAG|ATC | 1 | 1 | 8.048 |
| 115635115 | GT-AG | 0 | 0.0408602683052801 | 3046 | rna-XM_031052854.2 21436584 | 3 | 44769692 | 44772737 | Melopsittacus undulatus 13146 | CAG|GTATTTTTTG...CCTATCTTATTG/AATATTCTAATT...TCCAG|CTA | 1 | 1 | 9.968 |
| 115635116 | GT-AG | 1 | 99.51184138320944 | 1388 | rna-XM_031052854.2 21436584 | 4 | 44772828 | 44774215 | Melopsittacus undulatus 13146 | CAT|GTATCCTTTG...ATGTCCTTAACG/AATGTCCTTAAC...TTTAG|GTA | 1 | 1 | 12.026 |
| 115635117 | GT-AG | 0 | 1.000000099473604e-05 | 609 | rna-XM_031052854.2 21436584 | 5 | 44774391 | 44774999 | Melopsittacus undulatus 13146 | AAG|GTGGGTCTTC...TTTTTTTTGTTG/GACAGGATAAAT...TTCAG|GGC | 2 | 1 | 16.027 |
| 115635118 | GT-AG | 0 | 7.33745736869447e-05 | 1115 | rna-XM_031052854.2 21436584 | 6 | 44775154 | 44776268 | Melopsittacus undulatus 13146 | CAG|GTATTACAGT...GTGTTCTTAAAA/AGTGTTCTTAAA...GGCAG|GAT | 0 | 1 | 19.547 |
| 115635119 | GT-AG | 0 | 0.0580847760163774 | 1285 | rna-XM_031052854.2 21436584 | 7 | 44776357 | 44777641 | Melopsittacus undulatus 13146 | AAG|GTATACTTAC...TTTGTTTCAATC/GTTATATTTACT...AACAG|ATG | 1 | 1 | 21.559 |
| 115635120 | GT-AG | 0 | 1.000000099473604e-05 | 968 | rna-XM_031052854.2 21436584 | 8 | 44777890 | 44778857 | Melopsittacus undulatus 13146 | CAG|GTCAGTTTAC...GTGATCTAATTT/AGTGATCTAATT...TGCAG|TTA | 0 | 1 | 27.229 |
| 115635121 | GT-AG | 0 | 6.489163262122643e-05 | 1322 | rna-XM_031052854.2 21436584 | 9 | 44778974 | 44780295 | Melopsittacus undulatus 13146 | GCT|GTAAGTAATG...ATAATTTTAATA/ATAATTTTAATA...TATAG|GTA | 2 | 1 | 29.881 |
| 115635122 | GT-AG | 0 | 1.000000099473604e-05 | 1261 | rna-XM_031052854.2 21436584 | 10 | 44780447 | 44781707 | Melopsittacus undulatus 13146 | CTG|GTTGGTACTA...AATTTGTTGAGA/ACAGTACTGACT...TGTAG|GAA | 0 | 1 | 33.333 |
| 115635123 | GT-AG | 0 | 0.0455339550830052 | 1323 | rna-XM_031052854.2 21436584 | 11 | 44781809 | 44783131 | Melopsittacus undulatus 13146 | CAA|GTATGTTGTT...CTTGCTTTGATA/CTTGCTTTGATA...TTTAG|GAA | 2 | 1 | 35.642 |
| 115635124 | GT-AG | 0 | 1.000000099473604e-05 | 950 | rna-XM_031052854.2 21436584 | 12 | 44783238 | 44784187 | Melopsittacus undulatus 13146 | GAG|GTAAAGCAGT...ATTCTCTTTTTT/TTTGTATTCAAC...AATAG|GAT | 0 | 1 | 38.066 |
| 115635125 | GT-AG | 0 | 1.3097307433139635e-05 | 1271 | rna-XM_031052854.2 21436584 | 13 | 44784320 | 44785590 | Melopsittacus undulatus 13146 | CAG|GTAAGCATTA...ATGGCATTATTT/AATCAACTTACT...TGCAG|AAA | 0 | 1 | 41.084 |
| 115635126 | GT-AG | 0 | 1.000000099473604e-05 | 1070 | rna-XM_031052854.2 21436584 | 14 | 44785816 | 44786885 | Melopsittacus undulatus 13146 | AAG|GTAAGTGACT...CTTTTCTTCTTT/ATGGACTTCATG...TTCAG|AAC | 0 | 1 | 46.228 |
| 115635127 | GT-AG | 0 | 1.000000099473604e-05 | 681 | rna-XM_031052854.2 21436584 | 15 | 44786995 | 44787675 | Melopsittacus undulatus 13146 | AAG|GTTAGAAAAT...ACTGTCTTACTC/ACTGTTCTCATC...ACTAG|GTG | 1 | 1 | 48.72 |
| 115635128 | GT-AG | 0 | 1.000000099473604e-05 | 181 | rna-XM_031052854.2 21436584 | 16 | 44787871 | 44788051 | Melopsittacus undulatus 13146 | AAG|GTAAAGTTAT...ATTTTCTTGCAC/TTTTCTTGCACT...AATAG|AAC | 1 | 1 | 53.178 |
| 115635129 | GT-AG | 0 | 1.000000099473604e-05 | 1071 | rna-XM_031052854.2 21436584 | 17 | 44788183 | 44789253 | Melopsittacus undulatus 13146 | CTG|GTAAGCAAAG...TCTGCCTGAGAG/TAGTATCTGAAA...TTCAG|GGA | 0 | 1 | 56.173 |
| 115635130 | GT-AG | 0 | 1.000000099473604e-05 | 622 | rna-XM_031052854.2 21436584 | 18 | 44789329 | 44789950 | Melopsittacus undulatus 13146 | GTG|GTAAGTAATG...ACTTTCTTCTTC/AAAAATTTAACA...CACAG|GCA | 0 | 1 | 57.888 |
| 115635131 | GT-AG | 0 | 1.000000099473604e-05 | 963 | rna-XM_031052854.2 21436584 | 19 | 44790078 | 44791040 | Melopsittacus undulatus 13146 | CAG|GTAAGTGTAT...AAATTTTTATAA/AAAATTTTTATA...TTTAG|GAA | 1 | 1 | 60.791 |
| 115635132 | GT-AG | 0 | 0.0002984922836078 | 924 | rna-XM_031052854.2 21436584 | 20 | 44791199 | 44792122 | Melopsittacus undulatus 13146 | GAG|GTATGTAGCT...TTTTTCTTTTCT/AATTAGCTCAGT...TACAG|GTG | 0 | 1 | 64.403 |
| 115635133 | GT-AG | 0 | 0.0006293913901043 | 592 | rna-XM_031052854.2 21436584 | 21 | 44792566 | 44793157 | Melopsittacus undulatus 13146 | GTG|GTAAGTTTTA...TTTTCCTTCTCC/TTTTTTTTCTTT...CCTAG|TCA | 2 | 1 | 74.531 |
| 115635134 | GT-AG | 0 | 0.0035647473855 | 1309 | rna-XM_031052854.2 21436584 | 22 | 44793306 | 44794614 | Melopsittacus undulatus 13146 | ACT|GTAAGTTCTT...TTGGTCTTGACA/TTGGTCTTGACA...CTTAG|AGT | 0 | 1 | 77.915 |
| 115635135 | GT-AG | 0 | 1.000000099473604e-05 | 850 | rna-XM_031052854.2 21436584 | 23 | 44794712 | 44795561 | Melopsittacus undulatus 13146 | AAG|GTGTGTATCA...TTTTTTTTTTTT/AAAAATATCATG...TCCAG|ATA | 1 | 1 | 80.133 |
| 115635136 | GT-AG | 0 | 2.654415999935801e-05 | 115 | rna-XM_031052854.2 21436584 | 24 | 44795631 | 44795745 | Melopsittacus undulatus 13146 | AAG|GTAGGTTCTC...AATTCCTTTTCT/TGACTTCTCATA...TTTAG|AAA | 1 | 1 | 81.71 |
| 115635137 | GT-AG | 0 | 1.000000099473604e-05 | 771 | rna-XM_031052854.2 21436584 | 25 | 44795787 | 44796557 | Melopsittacus undulatus 13146 | CAG|GTAGGTGAAC...GAATCCTTTTCT/CCTTTTCTGAAA...TGCAG|ATA | 0 | 1 | 82.647 |
| 115635138 | GT-AG | 0 | 0.0001687298392785 | 570 | rna-XM_031052854.2 21436584 | 26 | 44796744 | 44797313 | Melopsittacus undulatus 13146 | AAG|GTATGTATGG...AATGTCTTTGTA/CTTTGTATTATG...TGTAG|GCA | 0 | 1 | 86.9 |
| 115635139 | GT-AG | 0 | 7.354173575025268e-05 | 7169 | rna-XM_031052854.2 21436584 | 27 | 44797416 | 44804584 | Melopsittacus undulatus 13146 | CAG|GTAATCTAGT...AATATTTTAAAT/AATATTTTAAAT...TTCAG|GAA | 0 | 1 | 89.232 |
| 115635140 | GT-AG | 0 | 1.000000099473604e-05 | 1552 | rna-XM_031052854.2 21436584 | 28 | 44804731 | 44806282 | Melopsittacus undulatus 13146 | AAG|GTTAGTGCAA...TTTTGTTTATTA/ATTTTGTTTATT...TGCAG|CCT | 2 | 1 | 92.57 |
| 115635141 | GT-AG | 0 | 1.000000099473604e-05 | 1980 | rna-XM_031052854.2 21436584 | 29 | 44806373 | 44808352 | Melopsittacus undulatus 13146 | AAG|GTAATGTGTG...AATTTTGTAACC/AATTTTGTAACC...TTCAG|CCC | 2 | 1 | 94.627 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);