introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
23 rows where transcript_id = 19079872
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position | 
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 101755530 | GT-AG | 0 | 1.000000099473604e-05 | 719 | rna-XM_042870635.1 19079872 | 4 | 8792320 | 8793038 | Lagopus leucura 30410 | AAG|GTAAATAATT...TTTTCTTTCTCT/TCTTGTTTTATA...TGAAG|GAG | 0 | 1 | 5.667 | 
| 101755531 | GT-AG | 0 | 1.000000099473604e-05 | 5350 | rna-XM_042870635.1 19079872 | 5 | 8786908 | 8792257 | Lagopus leucura 30410 | AAG|GTATTAGAAC...TCCACCATGCCT/CGAAAGCCCACC...CCTAG|GGA | 2 | 1 | 6.726 | 
| 101755532 | GT-AG | 0 | 1.000000099473604e-05 | 2159 | rna-XM_042870635.1 19079872 | 6 | 8784554 | 8786712 | Lagopus leucura 30410 | CCG|GTGAGTTTTC...TTCTTCTAAATT/TTGGATTTCATT...TGTAG|GCA | 2 | 1 | 10.055 | 
| 101755533 | GT-AG | 0 | 1.000000099473604e-05 | 962 | rna-XM_042870635.1 19079872 | 7 | 8781516 | 8782477 | Lagopus leucura 30410 | AAG|GTTGGTAGCT...TTTTTTTTAGAT/TTTAGATTCAAA...TGAAG|GTT | 2 | 1 | 45.493 | 
| 101755534 | GT-AG | 0 | 1.000000099473604e-05 | 910 | rna-XM_042870635.1 19079872 | 8 | 8779984 | 8780893 | Lagopus leucura 30410 | GGA|GTAAGTGTAC...CTTTTGTTAACA/CTTTTGTTAACA...CGTAG|GAA | 0 | 1 | 56.111 | 
| 101755535 | GT-AG | 0 | 1.000000099473604e-05 | 946 | rna-XM_042870635.1 19079872 | 9 | 8778968 | 8779913 | Lagopus leucura 30410 | AAG|GTGAGTGACT...GTACCATTATAT/TATATGCTAAAT...TCCAG|AAC | 1 | 1 | 57.306 | 
| 101755536 | GT-AG | 0 | 1.7316684398598397e-05 | 92 | rna-XM_042870635.1 19079872 | 10 | 8778721 | 8778812 | Lagopus leucura 30410 | CAG|GTAACAGTAG...CTACCATTAATA/CTACCATTAATA...TGCAG|ATA | 0 | 1 | 59.952 | 
| 101755537 | GT-AG | 0 | 1.000000099473604e-05 | 268 | rna-XM_042870635.1 19079872 | 11 | 8778392 | 8778659 | Lagopus leucura 30410 | AAG|GTATGAGATA...TTCTCTTTTACC/TTCTCTTTTACC...ATTAG|AGA | 1 | 1 | 60.994 | 
| 101755538 | GT-AG | 0 | 0.0002731876343915 | 858 | rna-XM_042870635.1 19079872 | 12 | 8777201 | 8778058 | Lagopus leucura 30410 | ATG|GTATGTGTGG...TGCATCTTACAG/TTGCATCTTACA...TTCAG|AAG | 1 | 1 | 66.678 | 
| 101755539 | GT-AG | 0 | 1.000000099473604e-05 | 359 | rna-XM_042870635.1 19079872 | 13 | 8776067 | 8776425 | Lagopus leucura 30410 | AAG|GTAAGATTGT...GCTACTTTATTT/TGCTACTTTATT...TACAG|TGT | 2 | 1 | 79.908 | 
| 101755540 | GT-AG | 0 | 0.0053561862069462 | 483 | rna-XM_042870635.1 19079872 | 14 | 8775528 | 8776010 | Lagopus leucura 30410 | CAG|GTATATCTTT...TGTTTCTTCACA/TGTTTCTTCACA...TACAG|AAG | 1 | 1 | 80.864 | 
| 101755541 | GT-AG | 0 | 1.954398465337167e-05 | 305 | rna-XM_042870635.1 19079872 | 15 | 8775077 | 8775381 | Lagopus leucura 30410 | GTT|GTAAGTAAGG...AGACCTTTAGTG/TTTAGTGTGATG...TCTAG|AGC | 0 | 1 | 83.356 | 
| 101755542 | GT-AG | 0 | 3.599770994824329e-05 | 1436 | rna-XM_042870635.1 19079872 | 16 | 8773506 | 8774941 | Lagopus leucura 30410 | AAG|GTACTTCAAT...AAAGCTTTAAAT/ACTGTTTTCAGT...GTCAG|ACC | 0 | 1 | 85.661 | 
| 101755543 | GT-AG | 0 | 1.000000099473604e-05 | 445 | rna-XM_042870635.1 19079872 | 17 | 8772944 | 8773388 | Lagopus leucura 30410 | AAG|GTTGGTTTGT...TAGTTCTTAAAT/TTAGTTCTTAAA...AACAG|AAC | 0 | 1 | 87.658 | 
| 101755544 | GC-AG | 0 | 1.000000099473604e-05 | 1636 | rna-XM_042870635.1 19079872 | 18 | 8771116 | 8772751 | Lagopus leucura 30410 | AAG|GCAAGTGTTT...TTTTTTTTTTTT/CATGTTATTACA...TGAAG|GTT | 0 | 1 | 90.935 | 
| 101755545 | GT-AG | 0 | 6.809131103518767e-05 | 743 | rna-XM_042870635.1 19079872 | 19 | 8770310 | 8771052 | Lagopus leucura 30410 | CAT|GTAAGTCTTT...CTCTCCTTGTTC/TTTTGTTTCACT...CACAG|ACT | 0 | 1 | 92.011 | 
| 101755546 | GT-AG | 0 | 1.000000099473604e-05 | 394 | rna-XM_042870635.1 19079872 | 20 | 8769796 | 8770189 | Lagopus leucura 30410 | CAG|GTAAGAGTAT...TAACTGTTAACT/TAACTGTTAACT...TCAAG|TCA | 0 | 1 | 94.059 | 
| 101755547 | GT-AG | 0 | 0.013120417588724 | 862 | rna-XM_042870635.1 19079872 | 21 | 8768862 | 8769723 | Lagopus leucura 30410 | ATG|GTAACTTCCC...TTTTTCTTCATA/TTTTTCTTCATA...TACAG|ATC | 0 | 1 | 95.288 | 
| 101755548 | GT-AG | 0 | 1.000000099473604e-05 | 563 | rna-XM_042870635.1 19079872 | 22 | 8768227 | 8768789 | Lagopus leucura 30410 | TGG|GTAATGAGCA...TTTTTTTTTTCT/CCCTGGGTGATT...ACCAG|TGG | 0 | 1 | 96.518 | 
| 101755549 | GT-AG | 0 | 0.004440561743594 | 998 | rna-XM_042870635.1 19079872 | 23 | 8767082 | 8768079 | Lagopus leucura 30410 | CAG|GTATTTTATT...AATCTTTTAGTT/ATTTTTTTCAGT...TCTAG|GAT | 0 | 1 | 99.027 | 
| 101760739 | GT-AG | 0 | 1.000000099473604e-05 | 524 | rna-XM_042870635.1 19079872 | 1 | 8799467 | 8799990 | Lagopus leucura 30410 | TGT|GTGAGTAGCG...TTTTCCTTTGTC/TCCTTTGTCATT...GGCAG|TGG | 0 | 1.434 | |
| 101760740 | GT-AG | 0 | 1.000000099473604e-05 | 1621 | rna-XM_042870635.1 19079872 | 2 | 8797720 | 8799340 | Lagopus leucura 30410 | GAG|GTTTGGTGGA...TGATTCTAAATA/GTGATTCTAAAT...CACAG|AAA | 0 | 3.585 | |
| 101760741 | GT-AG | 0 | 1.000000099473604e-05 | 4527 | rna-XM_042870635.1 19079872 | 3 | 8793125 | 8797651 | Lagopus leucura 30410 | CAG|GTGAGTATGT...TTGTTCTTGAAT/TTGTTCTTGAAT...TGCAG|GTA | 0 | 4.746 | 
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
  "dinucleotide_pair" TEXT,
  "is_minor" INTEGER,
  "score" REAL,
  "length" INTEGER,
  "transcript_id" INTEGER,
  "ordinal_index" INTEGER,
  "start" INTEGER,
  "end" INTEGER,
  "taxonomy_id" INTEGER,
  "scored_motifs" TEXT,
  "phase" INTEGER,
  "in_cds" INTEGER,
  "relative_position" REAL
  ,PRIMARY KEY ([id]),
   FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
   FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
    ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
    ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
    ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
    ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
    ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
    ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
    ON [introns] ([in_cds]);