introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
15 rows where transcript_id = 5879266
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position | 
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 29939630 | GT-AG | 0 | 0.0001976275399959 | 3433 | rna-XM_014966678.1 5879266 | 1 | 121121 | 124553 | Calidris pugnax 198806 | TCG|GTGGCTATTC...GCTCTTTTGAAA/CAGAGGTTTATA...TGCAG|GAG | 1 | 1 | 28.101 | 
| 29939631 | GT-AG | 0 | 1.000000099473604e-05 | 241 | rna-XM_014966678.1 5879266 | 2 | 124690 | 124930 | Calidris pugnax 198806 | GGG|GTAAGTAGCA...TGTCTTTTAAAT/TTTAAATTCATT...TGCAG|CTA | 2 | 1 | 33.492 | 
| 29939632 | GT-AG | 0 | 1.000000099473604e-05 | 2087 | rna-XM_014966678.1 5879266 | 3 | 124983 | 127069 | Calidris pugnax 198806 | CAG|GTAAAACACG...TGTTCCTGGGTT/CCTGGGTTCAAT...AGCAG|ACG | 0 | 1 | 35.553 | 
| 29939633 | GT-AG | 0 | 1.000000099473604e-05 | 5542 | rna-XM_014966678.1 5879266 | 4 | 127191 | 132732 | Calidris pugnax 198806 | CCG|GTAAGAGCCC...ACTGCTCTGACG/ACTGCTCTGACG...TGTAG|GTT | 1 | 1 | 40.349 | 
| 29939634 | GT-AG | 0 | 6.001781906468753e-05 | 1308 | rna-XM_014966678.1 5879266 | 5 | 132920 | 134227 | Calidris pugnax 198806 | GAG|GTAACGCCGG...GTTTCCTTCTTG/TTTGCACCCATG...TCCAG|ACC | 2 | 1 | 47.761 | 
| 29939635 | GT-AG | 0 | 0.0001481199490192 | 1362 | rna-XM_014966678.1 5879266 | 6 | 134396 | 135757 | Calidris pugnax 198806 | CAG|GTAACGCTCC...TTTCCCTTGCTT/AATTGGCTCATC...TCCAG|CTA | 2 | 1 | 54.419 | 
| 29939636 | GT-AG | 0 | 1.000000099473604e-05 | 1416 | rna-XM_014966678.1 5879266 | 7 | 135973 | 137388 | Calidris pugnax 198806 | AGA|GTGAGTGTGC...TTTTTGTTACCA/TTTTTTGTTACC...CTCAG|GCT | 1 | 1 | 62.941 | 
| 29939637 | GT-AG | 0 | 1.000000099473604e-05 | 3640 | rna-XM_014966678.1 5879266 | 8 | 137570 | 141209 | Calidris pugnax 198806 | GGG|GTAAGAGCAA...AAGGCCTTCATT/AAGGCCTTCATT...CCCAG|CAG | 2 | 1 | 70.115 | 
| 29939638 | GT-AG | 0 | 1.6895242672782463e-05 | 612 | rna-XM_014966678.1 5879266 | 9 | 141248 | 141859 | Calidris pugnax 198806 | GTG|GTAAGTTTTT...ACCTTCTTATGG/CACCTTCTTATG...GGCAG|GAC | 1 | 1 | 71.621 | 
| 29939639 | GT-AG | 0 | 1.000000099473604e-05 | 739 | rna-XM_014966678.1 5879266 | 10 | 141893 | 142631 | Calidris pugnax 198806 | TAG|GTAATTCCCA...ACACCGTTAATT/CCGTTAATTATC...CCTAG|GCC | 1 | 1 | 72.929 | 
| 29939640 | GT-AG | 0 | 1.000000099473604e-05 | 585 | rna-XM_014966678.1 5879266 | 11 | 142751 | 143335 | Calidris pugnax 198806 | AAG|GTGAGAAGGA...ACGTTTTTGCTG/CTGGAGGTGACG...GCCAG|ATG | 0 | 1 | 77.646 | 
| 29939641 | GT-AG | 0 | 1.000000099473604e-05 | 983 | rna-XM_014966678.1 5879266 | 12 | 143466 | 144448 | Calidris pugnax 198806 | ATG|GTGAGTAGGT...GTGTCCCTCGTG/TGATGGGACACC...CCTAG|GGA | 1 | 1 | 82.798 | 
| 29939642 | GT-AG | 0 | 1.000000099473604e-05 | 649 | rna-XM_014966678.1 5879266 | 13 | 144572 | 145220 | Calidris pugnax 198806 | GAG|GTAACACACA...AGAGCTTTTCCT/CTGGGATTCACA...TGCAG|GTC | 1 | 1 | 87.673 | 
| 29939643 | GT-AG | 0 | 1.000000099473604e-05 | 506 | rna-XM_014966678.1 5879266 | 15 | 145399 | 145904 | Calidris pugnax 198806 | CCC|GTGAGTATCC...TTTTTTTTAACA/TTTTTTTTAACA...CCCAG|GGG | 1 | 1 | 94.689 | 
| 29939644 | GT-AG | 0 | 1.000000099473604e-05 | 333 | rna-XM_014966678.1 5879266 | 16 | 146003 | 146335 | Calidris pugnax 198806 | CAG|GTAGGTGTGG...CACCCCACACCT/CCAAGGTTCACC...CCCAG|GAG | 0 | 1 | 98.573 | 
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
  "dinucleotide_pair" TEXT,
  "is_minor" INTEGER,
  "score" REAL,
  "length" INTEGER,
  "transcript_id" INTEGER,
  "ordinal_index" INTEGER,
  "start" INTEGER,
  "end" INTEGER,
  "taxonomy_id" INTEGER,
  "scored_motifs" TEXT,
  "phase" INTEGER,
  "in_cds" INTEGER,
  "relative_position" REAL
  ,PRIMARY KEY ([id]),
   FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
   FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
    ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
    ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
    ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
    ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
    ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
    ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
    ON [introns] ([in_cds]);