introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
26 rows where transcript_id = 6787108
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 35253171 | GT-AG | 0 | 1.3548353154192443e-05 | 116198 | rna-XM_041197339.1 6787108 | 1 | 76298999 | 76415196 | Carcharodon carcharias 13397 | AGC|GTAAGTCAGT...CATTTCTTATAT/CTTTCATTTACT...TGCAG|GTG | 0 | 1 | 10.433 |
| 35253172 | GT-AG | 0 | 1.000000099473604e-05 | 145396 | rna-XM_041197339.1 6787108 | 2 | 76153535 | 76298930 | Carcharodon carcharias 13397 | GAG|GTGAGTCACC...TGCTTTTTATTT/TTGCTTTTTATT...GACAG|GTC | 2 | 1 | 12.437 |
| 35253173 | GT-AG | 0 | 1.000000099473604e-05 | 217409 | rna-XM_041197339.1 6787108 | 3 | 75936026 | 76153434 | Carcharodon carcharias 13397 | AAG|GTAGGAGAAA...TTCTTTTTATTC/TTTCTTTTTATT...TTAAG|ATA | 0 | 1 | 15.385 |
| 35253174 | GT-AG | 0 | 1.000000099473604e-05 | 9035 | rna-XM_041197339.1 6787108 | 4 | 75926826 | 75935860 | Carcharodon carcharias 13397 | CGG|GTAAGTATGG...GCAATTTTACTG/ATTTTACTGAAA...TCCAG|CTA | 0 | 1 | 20.248 |
| 35253175 | GT-AG | 0 | 1.000000099473604e-05 | 27895 | rna-XM_041197339.1 6787108 | 5 | 75898854 | 75926748 | Carcharodon carcharias 13397 | TTG|GTGAGAATCA...AATTCCTCATTG/GAATTCCTCATT...TCCAG|GTC | 2 | 1 | 22.517 |
| 35253176 | GT-AG | 0 | 1.000000099473604e-05 | 3065 | rna-XM_041197339.1 6787108 | 6 | 75895724 | 75898788 | Carcharodon carcharias 13397 | GAG|GTAAAGATTG...ATTGTTTTACTT/TATTTATTTATT...TGCAG|GCT | 1 | 1 | 24.433 |
| 35253177 | GT-AG | 0 | 1.000000099473604e-05 | 3746 | rna-XM_041197339.1 6787108 | 7 | 75891859 | 75895604 | Carcharodon carcharias 13397 | CAG|GTAATTAATT...ACAATCTTGAAC/TAAGTATTGATT...TTCAG|GCA | 0 | 1 | 27.94 |
| 35253178 | GT-AG | 0 | 0.0003798051451524 | 12678 | rna-XM_041197339.1 6787108 | 8 | 75879091 | 75891768 | Carcharodon carcharias 13397 | AAG|GTAACTGCTT...TATTTTTTAATG/TATTTTTTAATG...TCTAG|GAC | 0 | 1 | 30.592 |
| 35253179 | GT-AG | 0 | 1.000000099473604e-05 | 13353 | rna-XM_041197339.1 6787108 | 9 | 75865558 | 75878910 | Carcharodon carcharias 13397 | GAG|GTAAATCCTT...GTTTCCTTTTTT/TTTTTTCTTCTT...TAAAG|GTG | 0 | 1 | 35.897 |
| 35253180 | GT-AG | 0 | 5.077536806755929e-05 | 26289 | rna-XM_041197339.1 6787108 | 10 | 75839122 | 75865410 | Carcharodon carcharias 13397 | CTA|GTAAGTGTCT...ATGGTCTTAAAT/ATGGTCTTAAAT...TGTAG|CCT | 0 | 1 | 40.23 |
| 35253181 | GT-AG | 0 | 1.000000099473604e-05 | 12616 | rna-XM_041197339.1 6787108 | 11 | 75826403 | 75839018 | Carcharodon carcharias 13397 | CAG|GTAAAATTAA...GACTTCTTGTCT/CTTTGTATAATA...CTTAG|GAA | 1 | 1 | 43.266 |
| 35253182 | GC-AG | 0 | 1.000000099473604e-05 | 2408 | rna-XM_041197339.1 6787108 | 12 | 75823918 | 75826325 | Carcharodon carcharias 13397 | CAG|GCAAGTAAAC...AGAGCATTAACA/TTTTTTGTCATT...TTCAG|GTA | 0 | 1 | 45.535 |
| 35253183 | GT-AG | 0 | 1.000000099473604e-05 | 1892 | rna-XM_041197339.1 6787108 | 13 | 75821934 | 75823825 | Carcharodon carcharias 13397 | AAA|GTAAGAAATT...TGTTTTTTGTTG/CAAAATATGATC...TCTAG|GTT | 2 | 1 | 48.246 |
| 35253184 | GT-AG | 0 | 1.000000099473604e-05 | 25182 | rna-XM_041197339.1 6787108 | 14 | 75796644 | 75821825 | Carcharodon carcharias 13397 | GAC|GTGAGTATCT...ACTGTTTTGAAA/ACTGTTTTGAAA...TCCAG|GAT | 2 | 1 | 51.429 |
| 35253185 | GT-AG | 0 | 0.0110942663638676 | 2638 | rna-XM_041197339.1 6787108 | 15 | 75793888 | 75796525 | Carcharodon carcharias 13397 | CAG|GTATCGTAGA...GATGCTATAATT/TATATACTGATT...CTTAG|GAT | 0 | 1 | 54.907 |
| 35253186 | GT-AG | 0 | 3.865936335683636e-05 | 1883 | rna-XM_041197339.1 6787108 | 16 | 75791871 | 75793753 | Carcharodon carcharias 13397 | GTG|GTAATTACTT...TTGTCTTTATTT/CTTTATTTGATT...TGCAG|TGA | 2 | 1 | 58.856 |
| 35253187 | GT-AG | 0 | 1.000000099473604e-05 | 9454 | rna-XM_041197339.1 6787108 | 17 | 75782303 | 75791756 | Carcharodon carcharias 13397 | TGG|GTAGGTACAG...AAGCTCTTGTCT/AGGATGCTAAAG...AACAG|ATA | 2 | 1 | 62.216 |
| 35253188 | GT-AG | 0 | 6.438454489335434e-05 | 16368 | rna-XM_041197339.1 6787108 | 18 | 75765763 | 75782130 | Carcharodon carcharias 13397 | CAG|GTAACATGAG...GTCTTTTTGAAC/GTCTTTTTGAAC...GTTAG|GGT | 0 | 1 | 67.286 |
| 35253189 | GT-AG | 0 | 0.0001828785379559 | 1011 | rna-XM_041197339.1 6787108 | 19 | 75764565 | 75765575 | Carcharodon carcharias 13397 | ACT|GTAAGTTGGT...GATGTTTTACTA/AGATGTTTTACT...CTCAG|CAC | 1 | 1 | 72.797 |
| 35253190 | GT-AG | 0 | 1.000000099473604e-05 | 99 | rna-XM_041197339.1 6787108 | 20 | 75764332 | 75764430 | Carcharodon carcharias 13397 | GAG|GTGCGTGTCA...ATGTTTTTGAAC/ATGTTTTTGAAC...TGCAG|CCT | 0 | 1 | 76.746 |
| 35253191 | GT-AG | 0 | 1.000000099473604e-05 | 1608 | rna-XM_041197339.1 6787108 | 21 | 75762600 | 75764207 | Carcharodon carcharias 13397 | TTG|GTAAGAAACT...AATGCATTGATG/AATGCATTGATG...TTCAG|ATC | 1 | 1 | 80.401 |
| 35253192 | GT-AG | 0 | 0.0008207390778718 | 10061 | rna-XM_041197339.1 6787108 | 22 | 75752426 | 75762486 | Carcharodon carcharias 13397 | GAG|GTATTTGCTA...GACACCTTGAAA/TTTTATCTGATG...TGCAG|ACC | 0 | 1 | 83.731 |
| 35253193 | GT-AG | 0 | 9.882257721349937e-05 | 4239 | rna-XM_041197339.1 6787108 | 23 | 75748061 | 75752299 | Carcharodon carcharias 13397 | AAG|GTACCAAATT...TGTGCTTTTCCC/TATAAACTGATA...CCCAG|GGA | 0 | 1 | 87.445 |
| 35253194 | GT-AG | 0 | 1.000000099473604e-05 | 1075 | rna-XM_041197339.1 6787108 | 24 | 75746886 | 75747960 | Carcharodon carcharias 13397 | CGG|GTAAGACCAT...TCAATCTCAATA/ATCAATCTCAAT...TTCAG|TTT | 1 | 1 | 90.392 |
| 35253195 | GT-AG | 0 | 1.000000099473604e-05 | 20297 | rna-XM_041197339.1 6787108 | 25 | 75726486 | 75746782 | Carcharodon carcharias 13397 | GAG|GTAAGTACTT...TCTCCCTTTGTG/TGCACACTGACA...GGCAG|GAA | 2 | 1 | 93.428 |
| 35253196 | GT-AG | 0 | 0.1574885735141567 | 6522 | rna-XM_041197339.1 6787108 | 26 | 75719806 | 75726327 | Carcharodon carcharias 13397 | GAG|GTAGCCTGTT...TGTCTTTTATTC/TTGTCTTTTATT...TACAG|GTG | 1 | 1 | 98.084 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);