introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
38 rows where transcript_id = 9059408
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 48954138 | GT-AG | 0 | 1.000000099473604e-05 | 115161 | rna-XM_036577004.1 9059408 | 1 | 7627517 | 7742677 | Colossoma macropomum 42526 | AAG|GTAAGCAGGG...TATTACTAAATT/TTATTACTAAAT...TTCAG|GGG | 1 | 1 | 2.657 |
| 48954139 | GT-AG | 0 | 1.000000099473604e-05 | 15883 | rna-XM_036577004.1 9059408 | 2 | 7611562 | 7627444 | Colossoma macropomum 42526 | AAG|GTAATCACCC...GTGTCATTCTCC/TGAGGGTTGACC...TGCAG|GTG | 1 | 1 | 4.164 |
| 48954140 | GT-AG | 0 | 1.000000099473604e-05 | 1459 | rna-XM_036577004.1 9059408 | 3 | 7609983 | 7611441 | Colossoma macropomum 42526 | CAG|GTGAAAACTC...AATGTTTTATGT/CAATGTTTTATG...TGCAG|ACC | 1 | 1 | 6.675 |
| 48954141 | GT-AG | 0 | 1.000000099473604e-05 | 18026 | rna-XM_036577004.1 9059408 | 4 | 7591837 | 7609862 | Colossoma macropomum 42526 | CAG|GTACAGACAC...TCTGTCTCGTCC/CACACACTTACG...CACAG|ACA | 1 | 1 | 9.186 |
| 48954142 | GT-AG | 0 | 1.000000099473604e-05 | 5013 | rna-XM_036577004.1 9059408 | 5 | 7586581 | 7591593 | Colossoma macropomum 42526 | TCG|GTTAGTACCT...TTATTATTATTC/TTATTATTCAAC...CCCAG|ATA | 1 | 1 | 14.271 |
| 48954143 | GT-AG | 0 | 1.000000099473604e-05 | 12102 | rna-XM_036577004.1 9059408 | 6 | 7574344 | 7586445 | Colossoma macropomum 42526 | ACG|GTGAGTGGCC...CTCTCCCTCTCT/CTGTCTCTCACC...CAAAG|GTG | 1 | 1 | 17.096 |
| 48954144 | GT-AG | 0 | 2.507022987115377e-05 | 5217 | rna-XM_036577004.1 9059408 | 7 | 7569007 | 7574223 | Colossoma macropomum 42526 | ATG|GTAGGTTCAT...GTGTCTTTGTCT/TCTCTCTCCATC...TCCAG|TCC | 1 | 1 | 19.607 |
| 48954145 | GT-AG | 0 | 0.0005836305993677 | 2853 | rna-XM_036577004.1 9059408 | 8 | 7565956 | 7568808 | Colossoma macropomum 42526 | ATG|GTACGTTTAC...CTCCTCTTACTC/TGTGTACTTATT...TGCAG|GGA | 1 | 1 | 23.75 |
| 48954146 | GT-AG | 0 | 1.000000099473604e-05 | 1787 | rna-XM_036577004.1 9059408 | 9 | 7564077 | 7565863 | Colossoma macropomum 42526 | AGA|GTGAGATATC...AGATTTTGAGCT/TCCTGTCTGATT...CGCAG|GTC | 0 | 1 | 25.675 |
| 48954147 | GT-AG | 0 | 0.1280063142610631 | 3823 | rna-XM_036577004.1 9059408 | 10 | 7560143 | 7563965 | Colossoma macropomum 42526 | AAG|GTACCCTACC...TCTCCCTCACCC/CTCTCCCTCACC...TGCAG|GAC | 0 | 1 | 27.997 |
| 48954148 | GT-AG | 0 | 1.000000099473604e-05 | 895 | rna-XM_036577004.1 9059408 | 11 | 7559124 | 7560018 | Colossoma macropomum 42526 | AAG|GTCAGTGGGA...ATCATCTAAACT/CATCATCTAAAC...TACAG|GCT | 1 | 1 | 30.592 |
| 48954149 | GT-AG | 0 | 1.000000099473604e-05 | 164 | rna-XM_036577004.1 9059408 | 12 | 7558850 | 7559013 | Colossoma macropomum 42526 | GAG|GTCAGTACTA...CTCTCCCTCTCT/CTCTCTCTCTCT...TGCAG|TAT | 0 | 1 | 32.894 |
| 48954150 | GT-AG | 0 | 1.000000099473604e-05 | 1689 | rna-XM_036577004.1 9059408 | 13 | 7557058 | 7558746 | Colossoma macropomum 42526 | AGG|GTAAGTGTGT...TCCACTTTCTCT/GTGTGCATGATC...TGAAG|GTG | 1 | 1 | 35.049 |
| 48954151 | GT-AG | 0 | 1.000000099473604e-05 | 604 | rna-XM_036577004.1 9059408 | 14 | 7556389 | 7556992 | Colossoma macropomum 42526 | ACA|GTGAGTTCAG...GTGACCATGACA/ACAACACTAACC...TTCAG|AGT | 0 | 1 | 36.409 |
| 48954152 | GT-AG | 0 | 1.000000099473604e-05 | 374 | rna-XM_036577004.1 9059408 | 15 | 7555921 | 7556294 | Colossoma macropomum 42526 | AAG|GTGAGACATG...TGTGTTTTACTG/GTGTGTTTTACT...TTCAG|CCC | 1 | 1 | 38.376 |
| 48954153 | GT-AG | 0 | 1.000000099473604e-05 | 1013 | rna-XM_036577004.1 9059408 | 16 | 7554788 | 7555800 | Colossoma macropomum 42526 | CCT|GTGAGTACAC...TGTGTGTGTGTG/GTGTGTGTGTGT...TATAG|GGT | 1 | 1 | 40.887 |
| 48954154 | GT-AG | 0 | 1.000000099473604e-05 | 381 | rna-XM_036577004.1 9059408 | 17 | 7554292 | 7554672 | Colossoma macropomum 42526 | CAG|GTAAACGCAC...TGTGGTTTGTTT/ATACATGTCAGT...TTCAG|GCC | 2 | 1 | 43.294 |
| 48954155 | GT-AG | 0 | 1.000000099473604e-05 | 231 | rna-XM_036577004.1 9059408 | 18 | 7553921 | 7554151 | Colossoma macropomum 42526 | GAG|GTCAGTCTGT...ACTCTTGTGACA/AATATAATAACA...TGCAG|CGT | 1 | 1 | 46.223 |
| 48954156 | GT-AG | 0 | 1.000000099473604e-05 | 746 | rna-XM_036577004.1 9059408 | 19 | 7553038 | 7553783 | Colossoma macropomum 42526 | GAG|GTGAAACAAG...GCCTCTTTCTCC/CAGTGTGTGACG...TGTAG|ACA | 0 | 1 | 49.09 |
| 48954157 | GT-AG | 0 | 1.000000099473604e-05 | 295 | rna-XM_036577004.1 9059408 | 20 | 7552662 | 7552956 | Colossoma macropomum 42526 | CCG|GTCAGTTCAC...TGTTCTTTTGCC/TGTTGTGTCATT...TCCAG|GAC | 0 | 1 | 50.785 |
| 48954158 | GT-AG | 0 | 1.000000099473604e-05 | 623 | rna-XM_036577004.1 9059408 | 21 | 7551911 | 7552533 | Colossoma macropomum 42526 | AAG|GTGAGGGATA...GGAATAGTGATA/GGAATAGTGATA...TCCAG|GAT | 2 | 1 | 53.463 |
| 48954159 | GT-AG | 0 | 1.000000099473604e-05 | 1202 | rna-XM_036577004.1 9059408 | 22 | 7550594 | 7551795 | Colossoma macropomum 42526 | CAG|GTACTACATA...CCTGCTGTAGTT/TAGTTTGTCAAT...TCCAG|GCT | 0 | 1 | 55.869 |
| 48954160 | GT-AG | 0 | 0.0015683400592638 | 834 | rna-XM_036577004.1 9059408 | 23 | 7549558 | 7550391 | Colossoma macropomum 42526 | CAG|GTACTCCCTG...TCTCCATTAACA/TCTCCATTAACA...CCCAG|AGG | 1 | 1 | 60.096 |
| 48954161 | GT-AG | 0 | 1.000000099473604e-05 | 2494 | rna-XM_036577004.1 9059408 | 24 | 7546954 | 7549447 | Colossoma macropomum 42526 | AAG|GTACATGCTC...CTAACTGTAAGT/AACATGCTAACT...TACAG|TTT | 0 | 1 | 62.398 |
| 48954162 | GT-AG | 0 | 1.000000099473604e-05 | 1529 | rna-XM_036577004.1 9059408 | 25 | 7545344 | 7546872 | Colossoma macropomum 42526 | AAG|GTACAGTAGC...TGACGTGTGACG/TGACGTGTGACG...TGTAG|GGT | 0 | 1 | 64.093 |
| 48954163 | GT-AG | 0 | 1.000000099473604e-05 | 1128 | rna-XM_036577004.1 9059408 | 26 | 7544092 | 7545219 | Colossoma macropomum 42526 | CAG|GTAAGAACAA...GATATTTTATAT/AGATATTTTATA...TGCAG|GTA | 1 | 1 | 66.688 |
| 48954164 | GT-AG | 0 | 1.000000099473604e-05 | 430 | rna-XM_036577004.1 9059408 | 27 | 7543540 | 7543969 | Colossoma macropomum 42526 | GAG|GTAAACTCAT...TGGTCATTAAGT/TTGTTGGTCATT...TGTAG|GTC | 0 | 1 | 69.24 |
| 48954165 | GT-AG | 0 | 1.000000099473604e-05 | 630 | rna-XM_036577004.1 9059408 | 28 | 7542798 | 7543427 | Colossoma macropomum 42526 | CAG|GTACAGAGAC...ATCTCCTTTCTT/TGTAATTTAATC...TTTAG|ATG | 1 | 1 | 71.584 |
| 48954166 | GT-AG | 0 | 0.0023022113351997 | 389 | rna-XM_036577004.1 9059408 | 29 | 7542299 | 7542687 | Colossoma macropomum 42526 | AAG|GTATCGGATA...GTCTCTATAAAC/CATGTGGTGAAT...CTCAG|ATG | 0 | 1 | 73.886 |
| 48954167 | GT-AG | 0 | 1.000000099473604e-05 | 1752 | rna-XM_036577004.1 9059408 | 30 | 7540416 | 7542167 | Colossoma macropomum 42526 | CAG|GTCAGGATCA...TGTGTGTGGGTG/GTGTGTGTGTGT...TGCAG|TGT | 2 | 1 | 76.627 |
| 48954168 | GT-AG | 0 | 1.000000099473604e-05 | 1616 | rna-XM_036577004.1 9059408 | 31 | 7538631 | 7540246 | Colossoma macropomum 42526 | CGG|GTCAGTACAC...CTGGTTTTCTCC/AAAGCAGTAAAG...ATCAG|CTG | 0 | 1 | 80.163 |
| 48954169 | GT-AG | 0 | 1.000000099473604e-05 | 1220 | rna-XM_036577004.1 9059408 | 32 | 7537240 | 7538459 | Colossoma macropomum 42526 | CAG|GTGAGGGGGA...TGTGTTTTACCT/GTGTGTTTTACC...GTCAG|CCC | 0 | 1 | 83.741 |
| 48954170 | GT-AG | 0 | 0.000259444419912 | 2655 | rna-XM_036577004.1 9059408 | 33 | 7534555 | 7537209 | Colossoma macropomum 42526 | CAG|GTATAGTGGA...AGAATCTTATTT/AAGAATCTTATT...CTCAG|GGA | 0 | 1 | 84.369 |
| 48954171 | GT-AG | 0 | 0.0001130499668809 | 2291 | rna-XM_036577004.1 9059408 | 34 | 7532202 | 7534492 | Colossoma macropomum 42526 | CAG|GTACCGCAAA...TGTTTCTGAATG/TATTATCTGATT...TTTAG|AGT | 2 | 1 | 85.666 |
| 48954172 | GT-AG | 0 | 1.000000099473604e-05 | 1161 | rna-XM_036577004.1 9059408 | 35 | 7530983 | 7532143 | Colossoma macropomum 42526 | AAG|GTGAGAAAAC...GTGATCTTAACC/TAAATTCTGATC...TGCAG|ATC | 0 | 1 | 86.88 |
| 48954173 | GT-AG | 0 | 1.000000099473604e-05 | 235 | rna-XM_036577004.1 9059408 | 36 | 7530673 | 7530907 | Colossoma macropomum 42526 | GAG|GTGAGTATTT...TAGGCCTAAAAT/ACTCGTCTGATT...AACAG|ATC | 0 | 1 | 88.449 |
| 48954174 | GT-AG | 0 | 1.938490575245366e-05 | 114 | rna-XM_036577004.1 9059408 | 37 | 7530384 | 7530497 | Colossoma macropomum 42526 | GAG|GTAAACCAGC...AGCCCCTAAACA/CAGCCCCTAAAC...TCCAG|AGC | 1 | 1 | 92.111 |
| 48954175 | GT-AG | 0 | 1.000000099473604e-05 | 899 | rna-XM_036577004.1 9059408 | 38 | 7529361 | 7530259 | Colossoma macropomum 42526 | GTG|GTGAGAACAG...TGAGCTTTATTG/TCAGTGCTAATC...TCCAG|TGC | 2 | 1 | 94.706 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);