introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
24 rows where transcript_id = 11478638
This data as json, CSV (advanced)
Suggested facets: score, length, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 63112197 | GT-AG | 0 | 1.000000099473604e-05 | 67 | rna-DGYR_LOCUS365 11478638 | 1 | 1445788 | 1445854 | Dimorphilus gyrociliatus 2664684 | TAT|GTCAGTATAA...ATTAATTTAATT/ATTAATTTAATT...TATAG|GTA | 2 | 1 | 1.345 |
| 63112198 | GT-AG | 0 | 0.0018930405741932 | 767 | rna-DGYR_LOCUS365 11478638 | 2 | 1444930 | 1445696 | Dimorphilus gyrociliatus 2664684 | GAG|GTATATATTC...CAAACATTATTT/ACATTATTTATA...TACAG|TTA | 0 | 1 | 2.12 |
| 63112199 | GT-AG | 0 | 1.6231473901663484e-05 | 283 | rna-DGYR_LOCUS365 11478638 | 3 | 1444542 | 1444824 | Dimorphilus gyrociliatus 2664684 | AAG|GTAACAATCA...GATGTTTTGTCA/TTTAATGTAATT...TTCAG|GGA | 0 | 1 | 3.014 |
| 63112200 | GT-AG | 0 | 0.0985191125799871 | 67 | rna-DGYR_LOCUS365 11478638 | 4 | 1444396 | 1444462 | Dimorphilus gyrociliatus 2664684 | ATG|GTATTTTTTA...TTTTTTTTATAA/ATTTTTTTTATA...TACAG|TGC | 1 | 1 | 3.687 |
| 63112201 | GT-AG | 0 | 1.000000099473604e-05 | 62 | rna-DGYR_LOCUS365 11478638 | 5 | 1443560 | 1443621 | Dimorphilus gyrociliatus 2664684 | TTG|GTAAGTAAAT...ATAGTTTTAATC/ATAGTTTTAATC...TCAAG|TGA | 1 | 1 | 10.277 |
| 63112202 | GT-AG | 0 | 1.3957701799636209e-05 | 58 | rna-DGYR_LOCUS365 11478638 | 6 | 1441925 | 1441982 | Dimorphilus gyrociliatus 2664684 | AAG|GTAGACCCTA...ATATTTTTCATT/ATATTTTTCATT...TATAG|GTC | 0 | 1 | 23.704 |
| 63112203 | GT-AG | 0 | 1.000000099473604e-05 | 57 | rna-DGYR_LOCUS365 11478638 | 7 | 1441442 | 1441498 | Dimorphilus gyrociliatus 2664684 | AAG|GTAAATAACG...TCAACTTTATTA/ACTTTATTAAAT...CACAG|GTA | 0 | 1 | 27.331 |
| 63112204 | GT-AG | 0 | 0.0001266737285516 | 437 | rna-DGYR_LOCUS365 11478638 | 8 | 1440885 | 1441321 | Dimorphilus gyrociliatus 2664684 | GAT|GTAAGTATAA...ATATCTTTGATT/CTTTGATTTATT...AATAG|GCC | 0 | 1 | 28.352 |
| 63112205 | GT-AG | 0 | 1.000000099473604e-05 | 76 | rna-DGYR_LOCUS365 11478638 | 9 | 1440489 | 1440564 | Dimorphilus gyrociliatus 2664684 | AAG|GTTTGTAAAT...ATTATTTTACAT/CATTATTTTACA...ATTAG|ATA | 2 | 1 | 31.077 |
| 63112206 | GT-AG | 0 | 1.000000099473604e-05 | 512 | rna-DGYR_LOCUS365 11478638 | 10 | 1438794 | 1439305 | Dimorphilus gyrociliatus 2664684 | GAG|GTACGTAATA...GTTCTCTTCTCT/CTCTTTTTCATC...AATAG|TGG | 0 | 1 | 41.149 |
| 63112207 | GT-AG | 0 | 1.000000099473604e-05 | 64 | rna-DGYR_LOCUS365 11478638 | 11 | 1438505 | 1438568 | Dimorphilus gyrociliatus 2664684 | TCG|GTAGGGTATC...ACTATCTTATAT/CTTAATTTCACT...AATAG|CCT | 0 | 1 | 43.065 |
| 63112208 | GT-AG | 0 | 0.0001527157895864 | 53 | rna-DGYR_LOCUS365 11478638 | 12 | 1437951 | 1438003 | Dimorphilus gyrociliatus 2664684 | GAG|GTACAATTCT...ATAATCTTATTA/CATAATCTTATT...TATAG|CAA | 0 | 1 | 47.331 |
| 63112209 | GT-AG | 0 | 1.7023883481123437e-05 | 57 | rna-DGYR_LOCUS365 11478638 | 13 | 1437246 | 1437302 | Dimorphilus gyrociliatus 2664684 | AGG|GTTTGTAAAA...AGTGTCTTATCA/GAGTGTCTTATC...AATAG|AGT | 0 | 1 | 52.848 |
| 63112210 | GT-AG | 0 | 1.000000099473604e-05 | 68 | rna-DGYR_LOCUS365 11478638 | 14 | 1437038 | 1437105 | Dimorphilus gyrociliatus 2664684 | CAG|GTAAAACTTT...TATACTGTAATT/TATACTGTAATT...TATAG|TCG | 2 | 1 | 54.04 |
| 63112211 | GT-AG | 0 | 1.000000099473604e-05 | 72 | rna-DGYR_LOCUS365 11478638 | 15 | 1436825 | 1436896 | Dimorphilus gyrociliatus 2664684 | TGA|GTAAGATCAA...TGTACATTATAT/AGTATTATAATT...CTTAG|GAT | 2 | 1 | 55.241 |
| 63112212 | GT-AG | 0 | 3.987089493962501e-05 | 65 | rna-DGYR_LOCUS365 11478638 | 16 | 1436300 | 1436364 | Dimorphilus gyrociliatus 2664684 | AAG|GTAATCATCC...GATGTATTAACA/AAACTATTCATA...TTTAG|AAT | 0 | 1 | 59.157 |
| 63112213 | GT-AG | 0 | 1.000000099473604e-05 | 82 | rna-DGYR_LOCUS365 11478638 | 17 | 1436056 | 1436137 | Dimorphilus gyrociliatus 2664684 | GAG|GTAAGATTCA...TATTACTTAATT/TGATTATTTATT...ATCAG|AGC | 0 | 1 | 60.536 |
| 63112214 | GT-AG | 0 | 1.000000099473604e-05 | 60 | rna-DGYR_LOCUS365 11478638 | 18 | 1434524 | 1434583 | Dimorphilus gyrociliatus 2664684 | TAG|GTAGATAAAC...TTAATATTAATA/TAATAACTGATT...TTTAG|GGA | 2 | 1 | 73.069 |
| 63112215 | GT-AG | 0 | 1.000000099473604e-05 | 55 | rna-DGYR_LOCUS365 11478638 | 19 | 1434204 | 1434258 | Dimorphilus gyrociliatus 2664684 | AAG|GTAAAAGAGA...GTTCCTTTTATG/TGATATTTCAAC...TTCAG|CTT | 0 | 1 | 75.326 |
| 63112216 | GT-AG | 0 | 1.000000099473604e-05 | 76 | rna-DGYR_LOCUS365 11478638 | 20 | 1433995 | 1434070 | Dimorphilus gyrociliatus 2664684 | AAG|GTACGTGAAA...TTCTTTTTAAAA/AATAATCTAATA...TTAAG|GTG | 1 | 1 | 76.458 |
| 63112217 | GT-AG | 0 | 1.810611809687285e-05 | 61 | rna-DGYR_LOCUS365 11478638 | 21 | 1433741 | 1433801 | Dimorphilus gyrociliatus 2664684 | AAG|GTTTGTAAAA...TTATCTTTAAAA/TATGATTTAATT...AACAG|ACC | 2 | 1 | 78.101 |
| 63112218 | GT-AG | 0 | 1.000000099473604e-05 | 1307 | rna-DGYR_LOCUS365 11478638 | 22 | 1432068 | 1433374 | Dimorphilus gyrociliatus 2664684 | CAG|GTAAGTGTAT...TTGCTCTTTACG/ATTATTCTAATA...TAAAG|GGC | 2 | 1 | 81.218 |
| 63112219 | GT-AG | 0 | 1.000000099473604e-05 | 59 | rna-DGYR_LOCUS365 11478638 | 23 | 1431850 | 1431908 | Dimorphilus gyrociliatus 2664684 | CAA|GTAAGGAACT...CAACTTTTATAT/TCTATTCTAAGT...TGTAG|ACA | 2 | 1 | 82.571 |
| 63112220 | GT-AG | 0 | 0.0004575534927744 | 64 | rna-DGYR_LOCUS365 11478638 | 24 | 1431270 | 1431333 | Dimorphilus gyrociliatus 2664684 | TAG|GTTTGTTTCC...TTCCTCTTACAG/AAATTAATCATT...AACAG|ATT | 2 | 1 | 86.965 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);