introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
26 rows where transcript_id = 22631654
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 122792666 | GT-AG | 0 | 1.000000099473604e-05 | 1525 | rna-XM_009408628.2 22631654 | 2 | 9174557 | 9176081 | Musa acuminata 4641 | CAG|GTGGTGAATG...GTATCGTTAATT/GTATCGTTAATT...TGCAG|GTA | 2 | 1 | 7.946 |
| 122792667 | GT-AG | 0 | 0.000607579871721 | 728 | rna-XM_009408628.2 22631654 | 3 | 9173645 | 9174372 | Musa acuminata 4641 | ACA|GTAAGTTTCT...TGTTATTTGACA/CTAGTTTTCATT...TGTAG|ATT | 0 | 1 | 13.167 |
| 122792668 | GT-AG | 0 | 1.000000099473604e-05 | 63 | rna-XM_009408628.2 22631654 | 4 | 9173420 | 9173482 | Musa acuminata 4641 | GAG|GTAAGGGATC...TCCTTTTTAATC/TCCTTTTTAATC...TACAG|AAG | 0 | 1 | 17.764 |
| 122792669 | GT-AG | 0 | 1.4900937506044006e-05 | 87 | rna-XM_009408628.2 22631654 | 5 | 9173095 | 9173181 | Musa acuminata 4641 | TAG|GTAGGTTGTA...TGTCCCTTTTTT/TTGGATTTGATA...TTCAG|CTG | 1 | 1 | 24.518 |
| 122792670 | GT-AG | 0 | 0.1192737782431889 | 81 | rna-XM_009408628.2 22631654 | 6 | 9172977 | 9173057 | Musa acuminata 4641 | AAG|GTATTCTATG...CTGCTTTTAAAT/AATCCTTTTATT...TACAG|ATT | 2 | 1 | 25.568 |
| 122792671 | GT-AG | 0 | 9.783090668277648e-05 | 4800 | rna-XM_009408628.2 22631654 | 7 | 9168130 | 9172929 | Musa acuminata 4641 | TCG|GTATAAACAA...GTATCATTGATT/ATTATTATCATC...TGCAG|GCT | 1 | 1 | 26.901 |
| 122792672 | GT-AG | 0 | 8.209570354141014e-05 | 209 | rna-XM_009408628.2 22631654 | 8 | 9167883 | 9168091 | Musa acuminata 4641 | AGG|GTAAATTTCT...TGGTTCTTATGC/TATCATCTGATT...TGCAG|CGC | 0 | 1 | 27.98 |
| 122792673 | GT-AG | 0 | 1.000000099473604e-05 | 106 | rna-XM_009408628.2 22631654 | 9 | 9167755 | 9167860 | Musa acuminata 4641 | TTG|GTAAGAAAAA...TTTGTTTTATTG/GTTTGTTTTATT...TACAG|ATA | 1 | 1 | 28.604 |
| 122792674 | GT-AG | 0 | 0.0027542381786522 | 225 | rna-XM_009408628.2 22631654 | 10 | 9167317 | 9167541 | Musa acuminata 4641 | GTG|GTATAATTTC...CATCTCTTTTCT/GATTTGTTCACG...TGCAG|TTG | 1 | 1 | 34.648 |
| 122792675 | GT-AG | 0 | 0.0016967230170828 | 88 | rna-XM_009408628.2 22631654 | 11 | 9167072 | 9167159 | Musa acuminata 4641 | TTT|GTAAGTTTGT...TTATTTTTGAAA/TTCTTGTTGATT...CACAG|TAA | 2 | 1 | 39.103 |
| 122792676 | GT-AG | 0 | 1.000000099473604e-05 | 1292 | rna-XM_009408628.2 22631654 | 12 | 9165728 | 9167019 | Musa acuminata 4641 | CAG|GTAATAACTT...CTGCTTTTAAGA/CTGCTTTTAAGA...TGCAG|ATG | 0 | 1 | 40.579 |
| 122792677 | GT-AG | 0 | 1.000000099473604e-05 | 105 | rna-XM_009408628.2 22631654 | 13 | 9165551 | 9165655 | Musa acuminata 4641 | ACA|GTTATATCCA...TTTTCCCTGATA/TATTATCTGATC...TATAG|GTG | 0 | 1 | 42.622 |
| 122792678 | GT-AG | 0 | 0.030863751283527 | 121 | rna-XM_009408628.2 22631654 | 14 | 9165380 | 9165500 | Musa acuminata 4641 | TCA|GTATGTCACA...GTCGTTTTAATT/GTCGTTTTAATT...TTTAG|ATT | 2 | 1 | 44.041 |
| 122792679 | GT-AG | 0 | 0.0184885904400497 | 91 | rna-XM_009408628.2 22631654 | 15 | 9165171 | 9165261 | Musa acuminata 4641 | AAG|GTACACTTTC...TGTTATTTGATA/TGTTATTTGATA...CTCAG|AAT | 0 | 1 | 47.389 |
| 122792680 | GT-AG | 0 | 1.000000099473604e-05 | 78 | rna-XM_009408628.2 22631654 | 16 | 9165029 | 9165106 | Musa acuminata 4641 | AAG|GTTATTTCAC...ATATTCTTATTT/AATATTCTTATT...CACAG|AAA | 1 | 1 | 49.205 |
| 122792681 | GT-AG | 0 | 0.0875853441933092 | 848 | rna-XM_009408628.2 22631654 | 17 | 9164097 | 9164944 | Musa acuminata 4641 | TAG|GTACCTATGT...TTTTTTTTAAAA/AAAATTTTGACT...TGCAG|AAC | 1 | 1 | 51.589 |
| 122792682 | GT-AG | 0 | 0.0036611155361387 | 117 | rna-XM_009408628.2 22631654 | 18 | 9163854 | 9163970 | Musa acuminata 4641 | CTT|GTATGTAGGT...TTCTCCATACTT/TTTAACCTCACT...AAAAG|GGA | 1 | 1 | 55.165 |
| 122792683 | GT-AG | 0 | 0.85571828701557 | 80 | rna-XM_009408628.2 22631654 | 19 | 9163577 | 9163656 | Musa acuminata 4641 | CAG|GTTTCCTCTG...TTTTTCTTACAT/ATTTTTCTTACA...TGCAG|GAC | 0 | 1 | 60.755 |
| 122792684 | GT-AG | 0 | 0.0018371823488617 | 97 | rna-XM_009408628.2 22631654 | 20 | 9163291 | 9163387 | Musa acuminata 4641 | AAG|GTTTCACTTG...TCTTCCTGAATG/CTTTTCCTCATT...GTAAG|GCA | 0 | 1 | 66.118 |
| 122792685 | GT-AG | 0 | 0.0149319108374274 | 128 | rna-XM_009408628.2 22631654 | 21 | 9162911 | 9163038 | Musa acuminata 4641 | ATG|GTATGTCTAA...TTGTTCTTAATA/CTTGTTCTTAAT...TGCAG|TTG | 0 | 1 | 73.269 |
| 122792686 | GT-AG | 0 | 0.0405222787323439 | 120 | rna-XM_009408628.2 22631654 | 22 | 9162741 | 9162860 | Musa acuminata 4641 | AAG|GTATTCTACT...ATTTCTTTTTCT/CTACTGTTCATG...TACAG|AGC | 2 | 1 | 74.688 |
| 122792687 | GT-AG | 0 | 6.746743235119129e-05 | 1322 | rna-XM_009408628.2 22631654 | 23 | 9161401 | 9162722 | Musa acuminata 4641 | CAG|GTACAATATC...CACCCTTTGATA/TGATACTTCACC...TGCAG|CCC | 2 | 1 | 75.199 |
| 122792688 | GT-AG | 0 | 1.000000099473604e-05 | 86 | rna-XM_009408628.2 22631654 | 24 | 9161232 | 9161317 | Musa acuminata 4641 | ATG|GTAAGATTTA...AAATTCTTATGT/CAAATTCTTATG...CGAAG|TTG | 1 | 1 | 77.554 |
| 122792689 | GT-AG | 0 | 1.8775271687511897e-05 | 173 | rna-XM_009408628.2 22631654 | 25 | 9160860 | 9161032 | Musa acuminata 4641 | AAT|GTAAGTGGCT...TTATTCTTGACT/TTATTCTTGACT...TATAG|GCC | 2 | 1 | 83.201 |
| 122792690 | GT-AG | 0 | 0.0104809672619533 | 127 | rna-XM_009408628.2 22631654 | 26 | 9160695 | 9160821 | Musa acuminata 4641 | AAG|GTATGCGTAC...TTTTCCTTGAAT/CCGGTGTTTACA...TTCAG|GAT | 1 | 1 | 84.279 |
| 122802445 | GT-AG | 0 | 0.0050480216818804 | 421 | rna-XM_009408628.2 22631654 | 1 | 9176203 | 9176623 | Musa acuminata 4641 | AAG|GTAAACTTCT...ATTTCTTTGATC/TTTGATCTAATT...GCCAG|ATT | 0 | 5.675 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);