introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
26 rows where transcript_id = 24436942
This data as json, CSV (advanced)
Suggested facets: is_minor, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position | 
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 133512354 | GT-AG | 0 | 1.95176064878523e-05 | 3702 | rna-XM_029779115.2 24436942 | 1 | 77970846 | 77974547 | Octopus sinensis 2607531 | CAG|GTAAATGTTT...ATTTTCTTAATT/TTTTATTTCATT...TTCAG|GAT | 1 | 1 | 10.474 | 
| 133512355 | GT-AG | 0 | 0.0677925309210502 | 4624 | rna-XM_029779115.2 24436942 | 2 | 77974594 | 77979217 | Octopus sinensis 2607531 | TAA|GTATGTTAAC...TTTTCTTTATTA/TTTTTCTTTATT...TTTAG|CCA | 2 | 1 | 11.508 | 
| 133512356 | GT-AG | 0 | 1.000000099473604e-05 | 332 | rna-XM_029779115.2 24436942 | 3 | 77979278 | 77979609 | Octopus sinensis 2607531 | AAT|GTAAGTACAA...GAATCATTGAAT/TGAATGTTGACT...TGCAG|AAT | 2 | 1 | 12.857 | 
| 133512357 | GT-AG | 0 | 8.683080263718357e-05 | 893 | rna-XM_029779115.2 24436942 | 4 | 77979722 | 77980614 | Octopus sinensis 2607531 | AAG|GTAATTTTTT...TCTTTTTTATTT/TTTTTATTTATT...TTTAG|GTG | 0 | 1 | 15.374 | 
| 133512358 | GT-AG | 0 | 1.000000099473604e-05 | 3491 | rna-XM_029779115.2 24436942 | 5 | 77980721 | 77984211 | Octopus sinensis 2607531 | GAG|GTGAGAGTCT...TTTTTTTTATAT/TTTTTTTTTATA...CACAG|GTA | 1 | 1 | 17.757 | 
| 133512359 | GT-AG | 1 | 99.9984657760266 | 752 | rna-XM_029779115.2 24436942 | 6 | 77984399 | 77985150 | Octopus sinensis 2607531 | TCA|GTATCCTTTT...TTTCCCTTAAAC/ATTTCCCTTAAA...TTTAG|ATC | 2 | 1 | 21.96 | 
| 133512360 | GT-AG | 0 | 1.000000099473604e-05 | 3612 | rna-XM_029779115.2 24436942 | 7 | 77985305 | 77988916 | Octopus sinensis 2607531 | AAG|GTTATTAAAT...AAATTTTTGATC/AAATTTTTGATC...TTTAG|GCT | 0 | 1 | 25.421 | 
| 133512361 | GT-AG | 0 | 1.000000099473604e-05 | 608 | rna-XM_029779115.2 24436942 | 8 | 77989132 | 77989739 | Octopus sinensis 2607531 | CAG|GTAAGAATTA...TATGTTTTAAAC/TATGTTTTAAAC...ACTAG|TAA | 2 | 1 | 30.254 | 
| 133512362 | GT-AG | 0 | 1.000000099473604e-05 | 11297 | rna-XM_029779115.2 24436942 | 9 | 77989842 | 78001138 | Octopus sinensis 2607531 | TGA|GTGAGTATAC...TGTTTCTTTTCT/TGTATACTAACC...ATCAG|TGT | 2 | 1 | 32.547 | 
| 133512363 | GT-AG | 0 | 0.0016472685013938 | 138 | rna-XM_029779115.2 24436942 | 10 | 78001235 | 78001372 | Octopus sinensis 2607531 | TAG|GTATATAACC...TGTATTTTAATA/ATATCTTTCATT...GACAG|ATT | 2 | 1 | 34.704 | 
| 133512364 | GT-AG | 0 | 1.7911461602449693e-05 | 1982 | rna-XM_029779115.2 24436942 | 11 | 78001479 | 78003460 | Octopus sinensis 2607531 | AAA|GTAAGTTAAT...AATATTTTAATA/AATATTTTAATA...CATAG|GTT | 0 | 1 | 37.087 | 
| 133512365 | GT-AG | 0 | 2.98612638412586e-05 | 699 | rna-XM_029779115.2 24436942 | 12 | 78003636 | 78004334 | Octopus sinensis 2607531 | CTG|GTAGGGTTTT...GTGTTCTTAAAT/TGTGTTCTTAAA...TGAAG|GAG | 1 | 1 | 41.02 | 
| 133512366 | GT-AG | 0 | 3.98091107681328e-05 | 126 | rna-XM_029779115.2 24436942 | 13 | 78004432 | 78004557 | Octopus sinensis 2607531 | CAC|GTAAGTAGTA...ATTTCTTTATCT/AATATATTCATT...TACAG|AAT | 2 | 1 | 43.201 | 
| 133512367 | GT-AG | 0 | 1.000000099473604e-05 | 1357 | rna-XM_029779115.2 24436942 | 14 | 78004653 | 78006009 | Octopus sinensis 2607531 | TAG|GTAAGTGTGA...TTTTTTTTAAAA/TTTTTTTTTAAA...TTTAG|TTT | 1 | 1 | 45.336 | 
| 133512368 | GT-AG | 0 | 1.000000099473604e-05 | 387 | rna-XM_029779115.2 24436942 | 15 | 78006128 | 78006514 | Octopus sinensis 2607531 | TAG|GTTAATATTT...ATACTTTTGATA/TATGTATTTATT...TTCAG|TAA | 2 | 1 | 47.988 | 
| 133512369 | GT-AG | 0 | 1.4711032813348937e-05 | 1444 | rna-XM_029779115.2 24436942 | 16 | 78006909 | 78008352 | Octopus sinensis 2607531 | AGT|GTAAGGCTTT...TATCTTTTATTA/TTATCTTTTATT...TTCAG|CAA | 0 | 1 | 56.844 | 
| 133512370 | GT-AG | 0 | 1.000000099473604e-05 | 3364 | rna-XM_029779115.2 24436942 | 17 | 78008649 | 78012012 | Octopus sinensis 2607531 | TGT|GTAAGTGAAA...TTTTCATTAATT/ATTATTTTCATT...TCTAG|GAG | 2 | 1 | 63.497 | 
| 133512371 | GT-AG | 0 | 1.000000099473604e-05 | 3728 | rna-XM_029779115.2 24436942 | 18 | 78012139 | 78015866 | Octopus sinensis 2607531 | TAG|GTAATGACAC...TGTTTTTCAACT/TACTTACTCACC...TGTAG|TGA | 2 | 1 | 66.33 | 
| 133512372 | GT-AG | 0 | 3.7855987875501504e-05 | 119 | rna-XM_029779115.2 24436942 | 19 | 78015946 | 78016064 | Octopus sinensis 2607531 | GAT|GTAAGTATGA...TTTTTTTTATAC/CTTTTTTTTATA...TGTAG|ATT | 0 | 1 | 68.105 | 
| 133512373 | GT-AG | 1 | 99.48738728968485 | 5537 | rna-XM_029779115.2 24436942 | 20 | 78016300 | 78021836 | Octopus sinensis 2607531 | AAC|GTATCCTTAA...TTTTCCTTAACT/TTTTCCTTAACT...TTCAG|ACC | 1 | 1 | 73.387 | 
| 133512374 | GT-AG | 0 | 1.000000099473604e-05 | 2378 | rna-XM_029779115.2 24436942 | 21 | 78021990 | 78024367 | Octopus sinensis 2607531 | CAA|GTGAGTATGT...TGTTTCTTAATT/TTGTTTCTTAAT...ATTAG|CCA | 1 | 1 | 76.826 | 
| 133512375 | GT-AG | 0 | 1.000000099473604e-05 | 506 | rna-XM_029779115.2 24436942 | 22 | 78024484 | 78024989 | Octopus sinensis 2607531 | AGA|GTGAGTATTT...ACTTCTTTACTT/AACTTCTTTACT...TTCAG|CAA | 0 | 1 | 79.434 | 
| 133512376 | GT-AG | 0 | 7.167056543445715e-05 | 1422 | rna-XM_029779115.2 24436942 | 23 | 78025156 | 78026577 | Octopus sinensis 2607531 | AAT|GTAAGTTGCA...ACATTTTTATTT/TGATTACTCATT...TGTAG|ACT | 1 | 1 | 83.165 | 
| 133512377 | GT-AG | 0 | 1.000000099473604e-05 | 452 | rna-XM_029779115.2 24436942 | 24 | 78026765 | 78027216 | Octopus sinensis 2607531 | AGG|GTAAGATATC...TGTTGCTTAGTG/GTGTTGCTTAGT...CTTAG|ACT | 2 | 1 | 87.368 | 
| 133512378 | GT-AG | 0 | 1.000000099473604e-05 | 225 | rna-XM_029779115.2 24436942 | 25 | 78027434 | 78027658 | Octopus sinensis 2607531 | CAG|GTGAGACTCA...TATTTTTTCATT/TATTTTTTCATT...TTTAG|GCC | 0 | 1 | 92.245 | 
| 133512379 | GT-AG | 0 | 0.0002297558325516 | 1655 | rna-XM_029779115.2 24436942 | 26 | 78027823 | 78029477 | Octopus sinensis 2607531 | AAA|GTAAGTTCTT...TCCTTTTTAATT/TCCTTTTTAATT...TACAG|GTA | 2 | 1 | 95.932 | 
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
  "dinucleotide_pair" TEXT,
  "is_minor" INTEGER,
  "score" REAL,
  "length" INTEGER,
  "transcript_id" INTEGER,
  "ordinal_index" INTEGER,
  "start" INTEGER,
  "end" INTEGER,
  "taxonomy_id" INTEGER,
  "scored_motifs" TEXT,
  "phase" INTEGER,
  "in_cds" INTEGER,
  "relative_position" REAL
  ,PRIMARY KEY ([id]),
   FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
   FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
    ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
    ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
    ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
    ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
    ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
    ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
    ON [introns] ([in_cds]);