introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
26 rows where transcript_id = 6061977
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 31222050 | GT-AG | 0 | 4.544147069647292e-05 | 8146 | rna-XM_008493003.2 6061977 | 3 | 13428110 | 13436255 | Calypte anna 9244 | AGA|GTAAGTATGT...TCTCTCTTATAT/AAATTATTTATT...TGCAG|ACC | 2 | 1 | 7.931 |
| 31222051 | GT-AG | 0 | 1.4731074631903578e-05 | 329 | rna-XM_008493003.2 6061977 | 4 | 13436368 | 13436696 | Calypte anna 9244 | GCA|GTAAGTGTCA...TGTAACTTATCT/CTGTAACTTATC...TCTAG|GAC | 0 | 1 | 9.809 |
| 31222052 | GT-AG | 0 | 1.000000099473604e-05 | 1647 | rna-XM_008493003.2 6061977 | 5 | 13436927 | 13438573 | Calypte anna 9244 | CAG|GTAATGAATA...AAAGTTTTACTT/GTTTTACTTACA...TTTAG|TGT | 2 | 1 | 13.665 |
| 31222053 | GT-AG | 0 | 1.000000099473604e-05 | 1043 | rna-XM_008493003.2 6061977 | 6 | 13438655 | 13439697 | Calypte anna 9244 | AAG|GTGAGTAGTT...ATGTGTTTGACA/ATGAGATTTATT...TTCAG|GAG | 2 | 1 | 15.023 |
| 31222054 | GT-AG | 0 | 1.000000099473604e-05 | 3926 | rna-XM_008493003.2 6061977 | 7 | 13439757 | 13443682 | Calypte anna 9244 | CAG|GTGAGTAAAC...TATTCTTTGTTT/CAGTATTTCATC...TAAAG|ATG | 1 | 1 | 16.013 |
| 31222055 | GT-AG | 0 | 1.000000099473604e-05 | 1281 | rna-XM_008493003.2 6061977 | 8 | 13443856 | 13445136 | Calypte anna 9244 | AAA|GTAAGTAAAT...GATTTTGTATCT/AATAATATAACT...TTCAG|GCA | 0 | 1 | 18.913 |
| 31222056 | GT-AG | 0 | 1.000000099473604e-05 | 1571 | rna-XM_008493003.2 6061977 | 9 | 13445176 | 13446746 | Calypte anna 9244 | AAG|GTAATGTGAT...CTTTTCTTACCT/CCTTTTCTTACC...TTTAG|GGT | 0 | 1 | 19.567 |
| 31222057 | GT-AG | 0 | 1.000000099473604e-05 | 1380 | rna-XM_008493003.2 6061977 | 10 | 13446960 | 13448339 | Calypte anna 9244 | GAG|GTAATGAGCT...AAAATGTTAATT/AAAATGTTAATT...TACAG|AGT | 0 | 1 | 23.139 |
| 31222058 | GT-AG | 0 | 0.0002038047978514 | 1093 | rna-XM_008493003.2 6061977 | 11 | 13448471 | 13449563 | Calypte anna 9244 | AAG|GTTTGTTTAT...GCATCTTTAAAA/GCATCTTTAAAA...TTCAG|GCC | 2 | 1 | 25.335 |
| 31222059 | GT-AG | 0 | 1.000000099473604e-05 | 192 | rna-XM_008493003.2 6061977 | 12 | 13449682 | 13449873 | Calypte anna 9244 | GAG|GTGAGCATGT...ATGATATTAATG/ATGATATTAATG...TATAG|GTG | 0 | 1 | 27.314 |
| 31222060 | GT-AG | 0 | 1.000000099473604e-05 | 1291 | rna-XM_008493003.2 6061977 | 13 | 13449984 | 13451274 | Calypte anna 9244 | TTG|GTAAGTATTT...CCTTCTTTCATT/CCTTCTTTCATT...TCTAG|TAA | 2 | 1 | 29.158 |
| 31222061 | GT-AG | 0 | 1.1569884534571225e-05 | 1725 | rna-XM_008493003.2 6061977 | 14 | 13451537 | 13453261 | Calypte anna 9244 | CAG|GTATTGATAC...GCTTCTGTAAAG/AAAGCAGTCATT...TCCAG|GAG | 0 | 1 | 33.551 |
| 31222062 | GT-AG | 0 | 2.8610722483436916e-05 | 1732 | rna-XM_008493003.2 6061977 | 15 | 13453361 | 13455092 | Calypte anna 9244 | ATG|GTAAGTTGGA...GGATCTTTATTA/TTATTATTAAAT...ATCAG|ACA | 0 | 1 | 35.211 |
| 31222063 | GT-AG | 0 | 1.000000099473604e-05 | 923 | rna-XM_008493003.2 6061977 | 16 | 13455258 | 13456180 | Calypte anna 9244 | CAG|GTAATAACGT...GGAGCTTTACAT/AACCTGTTTACT...ATCAG|GAA | 0 | 1 | 37.978 |
| 31222064 | GT-AG | 0 | 7.51354736632988e-05 | 434 | rna-XM_008493003.2 6061977 | 17 | 13456490 | 13456923 | Calypte anna 9244 | AAG|GTATGTCACT...CATTTTTTCTCT/AAGCCACTGATA...TTTAG|CAC | 0 | 1 | 43.159 |
| 31222065 | GT-AG | 0 | 1.000000099473604e-05 | 173 | rna-XM_008493003.2 6061977 | 18 | 13457179 | 13457351 | Calypte anna 9244 | AAG|GTAATGCTTC...CTGATTTGAACT/AAGTCACTGATT...TACAG|CGT | 0 | 1 | 47.435 |
| 31222066 | GT-AG | 0 | 1.000000099473604e-05 | 740 | rna-XM_008493003.2 6061977 | 19 | 13457497 | 13458236 | Calypte anna 9244 | CAG|GTGAAACTTT...CTAATTTTAATA/TTCTTTCTAATT...CACAG|ATT | 1 | 1 | 49.866 |
| 31222067 | GT-AG | 0 | 1.000000099473604e-05 | 131 | rna-XM_008493003.2 6061977 | 20 | 13458491 | 13458621 | Calypte anna 9244 | GAG|GTGAGATTCC...TTTTTCTTATTT/TTTTTTCTTATT...TTTAG|AAT | 0 | 1 | 54.125 |
| 31222068 | GT-AG | 0 | 1.000000099473604e-05 | 1425 | rna-XM_008493003.2 6061977 | 21 | 13458679 | 13460103 | Calypte anna 9244 | GAG|GTAAGGATAT...GATTATTTGACA/GATTATTTGACA...TACAG|GGA | 0 | 1 | 55.08 |
| 31222069 | GT-AG | 0 | 1.000000099473604e-05 | 2759 | rna-XM_008493003.2 6061977 | 22 | 13460668 | 13463426 | Calypte anna 9244 | AAG|GTTATAATTT...ACTTCCTTTCTA/TCCTTTCTAATG...CCTAG|GTA | 0 | 1 | 64.537 |
| 31222070 | GT-AG | 0 | 1.000000099473604e-05 | 1876 | rna-XM_008493003.2 6061977 | 23 | 13463729 | 13465604 | Calypte anna 9244 | GTG|GTAAGTCAGA...TCCCCTTTAATC/TTTAATCTAAAA...TCCAG|GAC | 2 | 1 | 69.601 |
| 31222071 | GT-AG | 0 | 1.000000099473604e-05 | 101 | rna-XM_008493003.2 6061977 | 24 | 13465694 | 13465794 | Calypte anna 9244 | GTG|GTGAGTTGCT...CAGTCCATAATC/ATCATTCTAATG...TGTAG|GGC | 1 | 1 | 71.093 |
| 31222072 | GT-AG | 0 | 1.000000099473604e-05 | 149 | rna-XM_008493003.2 6061977 | 25 | 13465915 | 13466063 | Calypte anna 9244 | AAG|GTCAGTAAGC...AAAGGCTTATTT/AAAAGGCTTATT...TGTAG|AAC | 1 | 1 | 73.105 |
| 31222073 | GT-AG | 0 | 1.000000099473604e-05 | 895 | rna-XM_008493003.2 6061977 | 26 | 13466156 | 13467050 | Calypte anna 9244 | AAG|GTAATGCAGC...AGTATCTGGAAT/TTGGTGCTGACT...TCCAG|CCC | 0 | 1 | 74.648 |
| 31238764 | GT-AG | 0 | 1.000000099473604e-05 | 7335 | rna-XM_008493003.2 6061977 | 1 | 13412300 | 13419634 | Calypte anna 9244 | GAG|GTGAGTCTTT...TTCTCTTTATTT/TTTCTCTTTATT...CCTAG|TCC | 0 | 5.785 | |
| 31238765 | GT-AG | 0 | 1.000000099473604e-05 | 8233 | rna-XM_008493003.2 6061977 | 2 | 13419692 | 13427924 | Calypte anna 9244 | GAG|GTAAGGAGAA...CTTTTTTTGTTC/AAACAACTCAAG...AACAG|GGT | 0 | 6.74 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);