introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
35 rows where transcript_id = 5963396
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 30552447 | GT-AG | 0 | 0.0002481121159098 | 894 | rna-XM_042331990.1 5963396 | 1 | 136427286 | 136428179 | Callorhinchus milii 7868 | GAG|GTACAGTTTA...TGTTCCTGATCT/CTGTTCCTGATC...CGCAG|TCC | 0 | 1 | 1.673 |
| 30552448 | GT-CT | 0 | 1.000000099473604e-05 | 1111 | rna-XM_042331990.1 5963396 | 2 | 136428324 | 136429434 | Callorhinchus milii 7868 | CAG|GTCAGTGCCA...AAAACTTTAATC/TTATTATTTATT...TCTCT|CTT | 0 | 1 | 4.54 |
| 30552449 | GT-AG | 0 | 9.549162021451914e-05 | 4118 | rna-XM_042331990.1 5963396 | 3 | 136429526 | 136433643 | Callorhinchus milii 7868 | TCA|GTAAGTCAGC...TGTGCCTTATCC/CTGTGCCTTATC...TCAAG|GCA | 1 | 1 | 6.352 |
| 30552450 | GT-AG | 0 | 1.000000099473604e-05 | 630 | rna-XM_042331990.1 5963396 | 4 | 136433752 | 136434381 | Callorhinchus milii 7868 | TGG|GTAAGGAAAA...GGGCCTTTCTCT/CGAGTGCTGATG...CTCAG|GTG | 1 | 1 | 8.503 |
| 30552451 | GT-AG | 0 | 1.000000099473604e-05 | 934 | rna-XM_042331990.1 5963396 | 5 | 136434492 | 136435425 | Callorhinchus milii 7868 | CAC|GTAAGTCACT...GGATTATTATTG/AGGATTATTATT...TCCAG|AAG | 0 | 1 | 10.693 |
| 30552452 | GT-AG | 0 | 0.0002303317209957 | 328 | rna-XM_042331990.1 5963396 | 6 | 136435692 | 136436019 | Callorhinchus milii 7868 | CGG|GTATAGTGAC...TTGTTGTTGATT/TTGTTGTTGATT...CACAG|GTT | 2 | 1 | 15.99 |
| 30552453 | GT-AG | 0 | 1.000000099473604e-05 | 243 | rna-XM_042331990.1 5963396 | 7 | 136436102 | 136436344 | Callorhinchus milii 7868 | CTG|GTAAGAGACG...GTGTCCTCACTG/TGTGTCCTCACT...TGCAG|CAC | 0 | 1 | 17.622 |
| 30552454 | GT-AG | 0 | 1.000000099473604e-05 | 549 | rna-XM_042331990.1 5963396 | 8 | 136436486 | 136437034 | Callorhinchus milii 7868 | AAG|GTACAGGACA...TTGTTGTTGCTG/TGCTGTTGTACT...ATCAG|GTG | 0 | 1 | 20.43 |
| 30552455 | GT-AG | 0 | 1.000000099473604e-05 | 983 | rna-XM_042331990.1 5963396 | 9 | 136437158 | 136438140 | Callorhinchus milii 7868 | AAG|GTGGGTGTTC...ATGGTCTTCTCT/ATTATTATAATT...CCCAG|GTG | 0 | 1 | 22.879 |
| 30552456 | GT-AG | 0 | 1.000000099473604e-05 | 2564 | rna-XM_042331990.1 5963396 | 10 | 136438225 | 136440788 | Callorhinchus milii 7868 | AAG|GTACAGTTCT...ACGCTCTCACTC/CACGCTCTCACT...CTCAG|TAC | 0 | 1 | 24.552 |
| 30552457 | GT-AG | 0 | 1.000000099473604e-05 | 649 | rna-XM_042331990.1 5963396 | 11 | 136440936 | 136441584 | Callorhinchus milii 7868 | CAG|GTACTGCCCT...ATCTCTCTACCC/CATCTCTCTACC...TCCAG|TTC | 0 | 1 | 27.479 |
| 30552458 | GT-AG | 0 | 0.0006001249423243 | 628 | rna-XM_042331990.1 5963396 | 12 | 136441683 | 136442310 | Callorhinchus milii 7868 | CTC|GTAAGTATCT...TTTATCTTAACC/CTATTGTTTATT...CCCAG|GTT | 2 | 1 | 29.431 |
| 30552459 | GT-AG | 0 | 1.000000099473604e-05 | 1056 | rna-XM_042331990.1 5963396 | 13 | 136442398 | 136443453 | Callorhinchus milii 7868 | CAG|GTGACTACCT...CTATCTTTTTCT/CTCTCTCTCATT...ACCAG|GTG | 2 | 1 | 31.163 |
| 30552460 | GT-AG | 0 | 1.9213398511126405e-05 | 1233 | rna-XM_042331990.1 5963396 | 14 | 136443566 | 136444798 | Callorhinchus milii 7868 | AAG|GTAACGTGAC...TTGCATTTGATT/ATTTGATTGATT...GACAG|GAT | 0 | 1 | 33.393 |
| 30552461 | GT-AG | 0 | 1.000000099473604e-05 | 681 | rna-XM_042331990.1 5963396 | 15 | 136445011 | 136445691 | Callorhinchus milii 7868 | CAG|GTAAGGCTAT...TCTCTCTTGCCC/CACACGCTAAAC...CTCAG|GTT | 2 | 1 | 37.614 |
| 30552462 | GT-AG | 0 | 1.000000099473604e-05 | 509 | rna-XM_042331990.1 5963396 | 16 | 136445849 | 136446357 | Callorhinchus milii 7868 | GAG|GTGAGGGAGG...CTCTCTTTCTCT/CTGTCTCTCACA...CACAG|ATG | 0 | 1 | 40.741 |
| 30552463 | GT-AG | 0 | 1.000000099473604e-05 | 1047 | rna-XM_042331990.1 5963396 | 17 | 136446598 | 136447644 | Callorhinchus milii 7868 | CAG|GTGAGAGAGG...CACTCATTATCA/AGTGTTATCATT...CACAG|AAC | 0 | 1 | 45.52 |
| 30552464 | GT-AG | 0 | 1.000000099473604e-05 | 2177 | rna-XM_042331990.1 5963396 | 18 | 136447888 | 136450064 | Callorhinchus milii 7868 | CAG|GTGCGCGGCC...TCCGTACTGACG/TCCGTACTGACG...TGCAG|CGG | 0 | 1 | 50.358 |
| 30552465 | GT-AG | 0 | 1.000000099473604e-05 | 109 | rna-XM_042331990.1 5963396 | 19 | 136450159 | 136450267 | Callorhinchus milii 7868 | TAG|GTGAGCACAC...CGGGCTATAACG/CGGGCTATAACG...TGCAG|AAC | 1 | 1 | 52.23 |
| 30552466 | GT-AG | 0 | 0.0117177563408745 | 2925 | rna-XM_042331990.1 5963396 | 20 | 136450417 | 136453341 | Callorhinchus milii 7868 | AAG|GTAACCTGAC...TTCTCTTTGCCT/CGCTCGCTTATT...TGCAG|CAC | 0 | 1 | 55.197 |
| 30552467 | GT-AG | 0 | 1.000000099473604e-05 | 628 | rna-XM_042331990.1 5963396 | 21 | 136453462 | 136454089 | Callorhinchus milii 7868 | GAG|GTGAGAGACC...CTCCCCTTCCCC/CCCCCACCCACA...CGCAG|GAG | 0 | 1 | 57.587 |
| 30552468 | GT-AG | 0 | 1.9544888486289317e-05 | 643 | rna-XM_042331990.1 5963396 | 22 | 136454222 | 136454864 | Callorhinchus milii 7868 | CTG|GTAAACACAG...CTTGTCTTCTTT/AAATAAATAAAC...CGCAG|TCA | 0 | 1 | 60.215 |
| 30552469 | GT-AG | 0 | 1.000000099473604e-05 | 925 | rna-XM_042331990.1 5963396 | 23 | 136454931 | 136455855 | Callorhinchus milii 7868 | AAG|GTAAATAATA...TCTGCTGTATTT/CCCCCACTAACC...CTCAG|CGT | 0 | 1 | 61.529 |
| 30552470 | GT-AG | 0 | 0.0011108088111537 | 1045 | rna-XM_042331990.1 5963396 | 24 | 136456093 | 136457137 | Callorhinchus milii 7868 | ATG|GTAACTGTCT...CGAGTTTTGATT/CGAGTTTTGATT...TCCAG|GAA | 0 | 1 | 66.249 |
| 30552471 | GT-AG | 0 | 1.000000099473604e-05 | 725 | rna-XM_042331990.1 5963396 | 25 | 136457236 | 136457960 | Callorhinchus milii 7868 | CAG|GTAAGCAGGG...CTTCCCTTCTCT/CTGTCGCTCACC...TCCAG|CCA | 2 | 1 | 68.2 |
| 30552472 | GT-AG | 0 | 1.000000099473604e-05 | 412 | rna-XM_042331990.1 5963396 | 26 | 136458039 | 136458450 | Callorhinchus milii 7868 | CAG|GTCAAATCTA...ACTGCTCTGACC/CGGTGACTGACT...CACAG|GCT | 2 | 1 | 69.753 |
| 30552473 | GT-AG | 0 | 1.000000099473604e-05 | 458 | rna-XM_042331990.1 5963396 | 27 | 136458650 | 136459107 | Callorhinchus milii 7868 | CTG|GTAAGTCACC...TCATGCTCGATG/AGTGGAGTGACT...TTCAG|GAC | 0 | 1 | 73.716 |
| 30552474 | GT-AG | 0 | 1.000000099473604e-05 | 502 | rna-XM_042331990.1 5963396 | 28 | 136459202 | 136459703 | Callorhinchus milii 7868 | AAG|GTGAACAGAC...CATCTCTTTCTT/CCACATGTGACG...GCCAG|CCC | 1 | 1 | 75.587 |
| 30552475 | GT-AG | 0 | 0.0002282021298753 | 371 | rna-XM_042331990.1 5963396 | 29 | 136459842 | 136460212 | Callorhinchus milii 7868 | CAG|GTACACGTGT...TCCTCCTTCCCT/CCTTCCCTCACT...TCCAG|AGC | 1 | 1 | 78.335 |
| 30552476 | GT-AG | 0 | 1.000000099473604e-05 | 834 | rna-XM_042331990.1 5963396 | 30 | 136460365 | 136461198 | Callorhinchus milii 7868 | AAG|GTAAGTGAGC...TATGTCTTATCT/TCTTCTTTCATC...TCAAG|AAA | 0 | 1 | 81.362 |
| 30552477 | GT-AG | 0 | 1.000000099473604e-05 | 619 | rna-XM_042331990.1 5963396 | 31 | 136461289 | 136461907 | Callorhinchus milii 7868 | GAG|GTGAGGTTCC...CTCACCTTGGCT/ATTCCCCTCACC...CCCAG|GAT | 0 | 1 | 83.154 |
| 30552478 | GT-AG | 0 | 1.000000099473604e-05 | 607 | rna-XM_042331990.1 5963396 | 32 | 136462059 | 136462665 | Callorhinchus milii 7868 | TCG|GTAAGACCCT...TTTTCGTTAATT/GACATTCTTATT...TGAAG|TGT | 1 | 1 | 86.161 |
| 30552479 | GT-AG | 0 | 2.256276167607341e-05 | 559 | rna-XM_042331990.1 5963396 | 33 | 136462952 | 136463510 | Callorhinchus milii 7868 | CAG|GTACAATTCC...CGGTTCTGAAAG/ACGGTTCTGAAA...TGTAG|ATA | 2 | 1 | 91.856 |
| 30552480 | GT-AG | 0 | 1.000000099473604e-05 | 798 | rna-XM_042331990.1 5963396 | 34 | 136463686 | 136464483 | Callorhinchus milii 7868 | CAG|GTAGGTGGCC...TTATTATTAATC/TTATTATTAATC...AATAG|ATT | 0 | 1 | 95.341 |
| 30552481 | GT-AG | 0 | 1.000000099473604e-05 | 614 | rna-XM_042331990.1 5963396 | 35 | 136464565 | 136465178 | Callorhinchus milii 7868 | CAG|GTGAGTTCCC...TGTTATTTGACC/TGTTATTTGACC...AACAG|TCA | 0 | 1 | 96.953 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);