introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
45 rows where transcript_id = 9059372
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 48952993 | GT-AG | 0 | 1.000000099473604e-05 | 553 | rna-XM_036568643.1 9059372 | 1 | 31738664 | 31739216 | Colossoma macropomum 42526 | AAG|GTGGGTTAAG...TTCATCTGGAAT/AAATATTTCATC...CACAG|ATG | 1 | 1 | 1.747 |
| 48952994 | GT-AG | 0 | 1.000000099473604e-05 | 173 | rna-XM_036568643.1 9059372 | 2 | 31738365 | 31738537 | Colossoma macropomum 42526 | AGG|GTGAGTTCAG...TTGTTCTTGTCA/CTATGTGTGACC...CTTAG|CTG | 1 | 1 | 3.439 |
| 48952995 | GT-AG | 0 | 1.000000099473604e-05 | 94 | rna-XM_036568643.1 9059372 | 3 | 31738133 | 31738226 | Colossoma macropomum 42526 | CAA|GTGGGTTGAC...TTGCTTTTATTC/CGTTTATTTACT...TGTAG|ACC | 1 | 1 | 5.294 |
| 48952996 | GT-AG | 0 | 1.000000099473604e-05 | 244 | rna-XM_036568643.1 9059372 | 4 | 31737757 | 31738000 | Colossoma macropomum 42526 | TTG|GTAAGATATC...CGGTCTTTCACT/CGGTCTTTCACT...ATTAG|CGG | 1 | 1 | 7.067 |
| 48952997 | GT-AG | 0 | 1.000000099473604e-05 | 345 | rna-XM_036568643.1 9059372 | 5 | 31737274 | 31737618 | Colossoma macropomum 42526 | GAA|GTAAGAAAAT...CCTTCTTTGACT/CCTTCTTTGACT...CTTAG|ACC | 1 | 1 | 8.921 |
| 48952998 | GT-AG | 0 | 1.000000099473604e-05 | 394 | rna-XM_036568643.1 9059372 | 6 | 31736721 | 31737114 | Colossoma macropomum 42526 | TAG|GTAAGAACTG...ATGCTCTTACAC/CATGCTCTTACA...TGCAG|GCA | 1 | 1 | 11.057 |
| 48952999 | GT-AG | 0 | 1.000000099473604e-05 | 244 | rna-XM_036568643.1 9059372 | 7 | 31736279 | 31736522 | Colossoma macropomum 42526 | GGG|GTCAGTTTCA...TTATTTTTACCT/ATTATTTTTACC...CATAG|AGG | 1 | 1 | 13.718 |
| 48953000 | GT-AG | 0 | 2.4981023162227104e-05 | 164 | rna-XM_036568643.1 9059372 | 8 | 31735932 | 31736095 | Colossoma macropomum 42526 | ATG|GTAAGCTCAC...TGGTTCTCATCC/TTGGTTCTCATC...TACAG|TTT | 1 | 1 | 16.176 |
| 48953001 | GT-AG | 0 | 1.000000099473604e-05 | 1320 | rna-XM_036568643.1 9059372 | 9 | 31734435 | 31735754 | Colossoma macropomum 42526 | CTG|GTGAGTGCTA...CTTGTCTTATCA/TCTTGTCTTATC...TTCAG|CTC | 1 | 1 | 18.554 |
| 48953002 | GT-AG | 0 | 1.000000099473604e-05 | 196 | rna-XM_036568643.1 9059372 | 10 | 31734083 | 31734278 | Colossoma macropomum 42526 | AAG|GTAAGGTCAC...TTGGTTTTATAG/TTTGGTTTTATA...TTTAG|ATC | 1 | 1 | 20.65 |
| 48953003 | GT-AG | 0 | 1.000000099473604e-05 | 264 | rna-XM_036568643.1 9059372 | 11 | 31733690 | 31733953 | Colossoma macropomum 42526 | TTG|GTGAGTTCAC...GCTAATTTAATG/ACAACACTCATT...TACAG|ACC | 1 | 1 | 22.383 |
| 48953004 | GT-AG | 0 | 1.000000099473604e-05 | 1626 | rna-XM_036568643.1 9059372 | 12 | 31731920 | 31733545 | Colossoma macropomum 42526 | CTG|GTGAGTGTAG...ATGACCTTAAAA/CATGACCTTAAA...GGCAG|GTG | 1 | 1 | 24.318 |
| 48953005 | GT-AG | 0 | 1.000000099473604e-05 | 109 | rna-XM_036568643.1 9059372 | 13 | 31731689 | 31731797 | Colossoma macropomum 42526 | CAG|GTCAGTATTT...TCTTCCTTCGTA/TGGGGGCTAACA...AAAAG|AAA | 0 | 1 | 25.957 |
| 48953006 | GT-AG | 0 | 1.000000099473604e-05 | 150 | rna-XM_036568643.1 9059372 | 14 | 31731370 | 31731519 | Colossoma macropomum 42526 | CCT|GTGAGTATTC...AATACCATAGCC/CCTGTATTGAAC...TTCAG|TGG | 1 | 1 | 28.228 |
| 48953007 | GT-AG | 0 | 1.000000099473604e-05 | 754 | rna-XM_036568643.1 9059372 | 15 | 31730430 | 31731183 | Colossoma macropomum 42526 | TGG|GTCAGTTACC...TATGGTTTAACT/TATGGTTTAACT...TGCAG|ACC | 1 | 1 | 30.727 |
| 48953008 | GT-AG | 0 | 1.000000099473604e-05 | 204 | rna-XM_036568643.1 9059372 | 16 | 31730097 | 31730300 | Colossoma macropomum 42526 | CGG|GTGTGTGTGT...TACTTTTTTGCT/TATTGTGTCAGA...TACAG|CTC | 1 | 1 | 32.46 |
| 48953009 | GT-AG | 0 | 1.000000099473604e-05 | 88 | rna-XM_036568643.1 9059372 | 17 | 31729919 | 31730006 | Colossoma macropomum 42526 | CAG|GTGTGCCCAA...AATGCACTGACG/AATGCACTGACG...TATAG|GCT | 1 | 1 | 33.669 |
| 48953010 | GT-AG | 0 | 0.0003714933675588 | 1009 | rna-XM_036568643.1 9059372 | 18 | 31728709 | 31729717 | Colossoma macropomum 42526 | CAG|GTATGCAGCT...TGTGTCTAACCA/CTGTGTCTAACC...CACAG|AGG | 1 | 1 | 36.37 |
| 48953011 | GT-AG | 0 | 1.000000099473604e-05 | 387 | rna-XM_036568643.1 9059372 | 19 | 31728043 | 31728429 | Colossoma macropomum 42526 | CCA|GTGAGTAGAA...TCCTCTTTATCT/TTTTTTCTCAGT...CATAG|AAC | 1 | 1 | 40.118 |
| 48953012 | GT-AG | 0 | 9.554150865882922e-05 | 742 | rna-XM_036568643.1 9059372 | 20 | 31727037 | 31727778 | Colossoma macropomum 42526 | CCT|GTGAGTTTCA...AAATCTTTAACA/AAATCTTTAACA...TTCAG|CCC | 1 | 1 | 43.665 |
| 48953013 | GT-AG | 0 | 0.1342879664064112 | 121 | rna-XM_036568643.1 9059372 | 21 | 31726821 | 31726941 | Colossoma macropomum 42526 | AAG|GTATTCTTCA...ACATTCTTCTCT/GCTTTGTTAAAG...CTTAG|TTG | 0 | 1 | 44.942 |
| 48953014 | GT-AG | 0 | 1.000000099473604e-05 | 145 | rna-XM_036568643.1 9059372 | 22 | 31726507 | 31726651 | Colossoma macropomum 42526 | CAC|GTAAGACTGC...TATCTCTTATTC/GTATCTCTTATT...TTTAG|CTC | 1 | 1 | 47.212 |
| 48953015 | GT-AG | 0 | 1.000000099473604e-05 | 603 | rna-XM_036568643.1 9059372 | 23 | 31725817 | 31726419 | Colossoma macropomum 42526 | CAG|GTCAGCAGTG...TTAGCCTTTTTG/CTCTGTCTCACT...CTCAG|ATA | 1 | 1 | 48.381 |
| 48953016 | GT-AG | 0 | 1.94434150281699e-05 | 713 | rna-XM_036568643.1 9059372 | 24 | 31724912 | 31725624 | Colossoma macropomum 42526 | CGG|GTAATTATTC...CTCTCCCTATCC/CTTGTCTTCATG...TTTAG|ATG | 1 | 1 | 50.961 |
| 48953017 | GT-AG | 0 | 1.000000099473604e-05 | 557 | rna-XM_036568643.1 9059372 | 25 | 31724082 | 31724638 | Colossoma macropomum 42526 | CAG|GTGAGTATGT...CTTTTCTTAACA/CTTTTCTTAACA...CTCAG|CTG | 1 | 1 | 54.629 |
| 48953018 | GT-AG | 0 | 0.0001422569796378 | 93 | rna-XM_036568643.1 9059372 | 26 | 31723797 | 31723889 | Colossoma macropomum 42526 | GCC|GTACGTGAGC...CTCTTCTTGCTG/TTCTTGCTGAGC...TGTAG|ACC | 1 | 1 | 57.208 |
| 48953019 | GT-AG | 0 | 1.000000099473604e-05 | 256 | rna-XM_036568643.1 9059372 | 27 | 31723451 | 31723706 | Colossoma macropomum 42526 | CAG|GTGAGTTCTT...TTTTTGTTACTT/ATTTTTGTTACT...TGCAG|TGC | 1 | 1 | 58.417 |
| 48953020 | GT-AG | 0 | 3.170682362787984e-05 | 265 | rna-XM_036568643.1 9059372 | 28 | 31722919 | 31723183 | Colossoma macropomum 42526 | CTA|GTAGGTTACA...TTCACTCTATCA/TATTATTTCACT...TCTAG|TTT | 1 | 1 | 62.005 |
| 48953021 | GT-AG | 0 | 1.000000099473604e-05 | 239 | rna-XM_036568643.1 9059372 | 29 | 31722563 | 31722801 | Colossoma macropomum 42526 | CAG|GTAGATTAAA...AAATCTCTATTA/CATTTGCTGACA...ATCAG|GTG | 1 | 1 | 63.577 |
| 48953022 | GT-AG | 0 | 6.682331141780796e-05 | 159 | rna-XM_036568643.1 9059372 | 30 | 31722239 | 31722397 | Colossoma macropomum 42526 | CAG|GTACACCTGC...ATAATCTGTGTG/AAATGAATAATC...AACAG|ATA | 1 | 1 | 65.793 |
| 48953023 | GT-AG | 0 | 1.000000099473604e-05 | 149 | rna-XM_036568643.1 9059372 | 31 | 31721934 | 31722082 | Colossoma macropomum 42526 | CAG|GTAAAAACTA...TATGCCATGATT/TGATTGTTTATT...TGCAG|ATC | 1 | 1 | 67.889 |
| 48953024 | GT-AG | 0 | 1.000000099473604e-05 | 184 | rna-XM_036568643.1 9059372 | 32 | 31721636 | 31721819 | Colossoma macropomum 42526 | CGA|GTGAGTAAAA...ACATCATTAACA/CTCCAATTCACC...CTCAG|CTG | 1 | 1 | 69.421 |
| 48953025 | GT-AG | 0 | 1.000000099473604e-05 | 221 | rna-XM_036568643.1 9059372 | 33 | 31721145 | 31721365 | Colossoma macropomum 42526 | CAG|GTACTGCAGG...TGTATCTTAATG/CCTCTTTTTATT...TCTAG|CTA | 1 | 1 | 73.049 |
| 48953026 | GT-AG | 0 | 1.000000099473604e-05 | 420 | rna-XM_036568643.1 9059372 | 34 | 31720534 | 31720953 | Colossoma macropomum 42526 | ATG|GTGAGTCTGA...CAGGTCTTAATC/TTCGCTCTCATT...TCTAG|CCT | 0 | 1 | 75.615 |
| 48953027 | GT-AG | 0 | 0.0007868315990528 | 107 | rna-XM_036568643.1 9059372 | 35 | 31720339 | 31720445 | Colossoma macropomum 42526 | ACA|GTAAGCTGTT...CATGCTTTTGTT/TTGTTTTGCACC...TGTAG|ATA | 1 | 1 | 76.797 |
| 48953028 | GT-AG | 0 | 0.0082042056253597 | 427 | rna-XM_036568643.1 9059372 | 36 | 31719732 | 31720158 | Colossoma macropomum 42526 | CTG|GTACTCTATA...ATATTTTTACTG/AATATTTTTACT...CATAG|GTC | 1 | 1 | 79.215 |
| 48953029 | GT-AG | 0 | 0.0112700302744067 | 141 | rna-XM_036568643.1 9059372 | 37 | 31719501 | 31719641 | Colossoma macropomum 42526 | CAG|GTAGCCACAC...TAAATCTTGAAT/TAAATCTTGAAT...TTTAG|CTA | 1 | 1 | 80.425 |
| 48953030 | GT-AG | 0 | 0.0001149114973132 | 301 | rna-XM_036568643.1 9059372 | 38 | 31719011 | 31719311 | Colossoma macropomum 42526 | CTG|GTACTGTTCC...TGTTTCTTTTTT/GCTTTGTTTATG...TTCAG|GTT | 1 | 1 | 82.964 |
| 48953031 | GT-AG | 0 | 1.000000099473604e-05 | 343 | rna-XM_036568643.1 9059372 | 39 | 31718578 | 31718920 | Colossoma macropomum 42526 | CGC|GTAAGAACCT...TTGCTTTTAACC/TTGCTTTTAACC...GCTAG|AAT | 1 | 1 | 84.173 |
| 48953032 | GT-AG | 0 | 1.000000099473604e-05 | 752 | rna-XM_036568643.1 9059372 | 40 | 31717374 | 31718125 | Colossoma macropomum 42526 | CAG|GTGAGCATAT...CTTTTTGTGACT/CTTTTTGTGACT...TACAG|ATG | 0 | 1 | 90.246 |
| 48953033 | GT-AG | 0 | 1.000000099473604e-05 | 143 | rna-XM_036568643.1 9059372 | 41 | 31717092 | 31717234 | Colossoma macropomum 42526 | CTG|GTAAAGCTCA...GGTGTCTTAATC/GGTGTCTTAATC...TCCAG|TTC | 1 | 1 | 92.113 |
| 48953034 | GT-AG | 0 | 1.000000099473604e-05 | 586 | rna-XM_036568643.1 9059372 | 42 | 31716338 | 31716923 | Colossoma macropomum 42526 | CCA|GTGAGTGACT...TTTTCCTCACTC/ATTTTCCTCACT...AACAG|AAT | 1 | 1 | 94.371 |
| 48953035 | GT-AG | 0 | 0.0045328865573538 | 963 | rna-XM_036568643.1 9059372 | 43 | 31715249 | 31716211 | Colossoma macropomum 42526 | CAC|GTAAGCTTCA...AGTACTTTGATG/CATGTTCTCAAA...TTTAG|ATG | 1 | 1 | 96.063 |
| 48953036 | GT-AG | 0 | 1.000000099473604e-05 | 280 | rna-XM_036568643.1 9059372 | 44 | 31714862 | 31715141 | Colossoma macropomum 42526 | CAG|GTAACACTCA...GATGCTGTCACT/GATGCTGTCACT...CTCAG|GGC | 0 | 1 | 97.501 |
| 48953037 | GT-AG | 0 | 1.000000099473604e-05 | 155 | rna-XM_036568643.1 9059372 | 45 | 31714590 | 31714744 | Colossoma macropomum 42526 | TTG|GTAAGTCTGA...GTGTTATTGAAC/GTGTTATTGAAC...CACAG|AAC | 0 | 1 | 99.073 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);