introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
32 rows where transcript_id = 27300298
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 151909875 | GT-AG | 0 | 0.000481173163608 | 10913 | rna-XM_031600387.1 27300298 | 1 | 10351543 | 10362455 | Phasianus colchicus 9054 | GCC|GTAACGGCGC...TAATCTTTCGCT/CTTTCGCTGATC...CGCAG|TGC | 1 | 1 | 1.876 |
| 151909876 | GT-AG | 0 | 1.000000099473604e-05 | 3422 | rna-XM_031600387.1 27300298 | 2 | 10347963 | 10351384 | Phasianus colchicus 9054 | GAG|GTTGGTTTGC...CATCTTTTATTC/TCATCTTTTATT...TCTAG|GCT | 0 | 1 | 4.84 |
| 151909877 | GT-AG | 0 | 3.7040641551669006e-05 | 912 | rna-XM_031600387.1 27300298 | 3 | 10346915 | 10347826 | Phasianus colchicus 9054 | ATG|GTAAGTTTTT...GTGCTCTTCACT/TTCATTTTCATT...TTCAG|GCG | 1 | 1 | 7.391 |
| 151909878 | GT-AG | 0 | 0.00131778466396 | 765 | rna-XM_031600387.1 27300298 | 4 | 10346076 | 10346840 | Phasianus colchicus 9054 | AAG|GTATTATATG...GACTTCTTGAAA/AGATGTCTAATC...TGCAG|ACA | 0 | 1 | 8.779 |
| 151909879 | GT-AG | 0 | 1.000000099473604e-05 | 477 | rna-XM_031600387.1 27300298 | 5 | 10345410 | 10345886 | Phasianus colchicus 9054 | GAG|GTACTGCCAA...GCTGTTTTAATC/TTTAATCTGATT...TGCAG|GTT | 0 | 1 | 12.324 |
| 151909880 | GT-AG | 0 | 1.000000099473604e-05 | 495 | rna-XM_031600387.1 27300298 | 6 | 10344851 | 10345345 | Phasianus colchicus 9054 | AAG|GTAAGAACAA...CATTTTTTAGTC/TCATTTTTTAGT...TCCAG|AGC | 1 | 1 | 13.525 |
| 151909881 | GT-AG | 0 | 1.000000099473604e-05 | 1165 | rna-XM_031600387.1 27300298 | 7 | 10343471 | 10344635 | Phasianus colchicus 9054 | ATG|GTGCGTCATT...ATCTCCATATCT/TCTCTTCTCATC...TGCAG|GTG | 0 | 1 | 17.558 |
| 151909882 | GT-AG | 0 | 0.0009737417083359 | 1164 | rna-XM_031600387.1 27300298 | 8 | 10342186 | 10343349 | Phasianus colchicus 9054 | AAC|GTAACTCAAA...GTGGTGTTAATT/GTGGTGTTAATT...TGCAG|GCT | 1 | 1 | 19.827 |
| 151909883 | GT-AG | 0 | 7.039112821481014e-05 | 1081 | rna-XM_031600387.1 27300298 | 9 | 10340916 | 10341996 | Phasianus colchicus 9054 | TCC|GTAAGATTCC...TATATCTAAACG/CTATATCTAAAC...TGTAG|CTT | 1 | 1 | 23.373 |
| 151909884 | GT-AG | 0 | 0.0003282073930713 | 617 | rna-XM_031600387.1 27300298 | 10 | 10340119 | 10340735 | Phasianus colchicus 9054 | AGC|GTATGTAAAT...CTACCATCAGCA/CAACCACACATT...CACAG|CCT | 1 | 1 | 26.749 |
| 151909885 | GT-AG | 0 | 0.0019773219034758 | 1021 | rna-XM_031600387.1 27300298 | 11 | 10338988 | 10340008 | Phasianus colchicus 9054 | GTT|GTACGTGTCT...AAGACCTTACTT/AAAGACCTTACT...CCCAG|GAG | 0 | 1 | 28.813 |
| 151909886 | GT-AG | 0 | 1.8072507381603385e-05 | 1030 | rna-XM_031600387.1 27300298 | 12 | 10337878 | 10338907 | Phasianus colchicus 9054 | CTT|GTAAGTTAAC...CTCCCCATACCA/GCTACACTCACG...CCTAG|GTG | 2 | 1 | 30.313 |
| 151909887 | GT-AG | 0 | 0.0003680879898408 | 1714 | rna-XM_031600387.1 27300298 | 13 | 10336013 | 10337726 | Phasianus colchicus 9054 | TTG|GTATGTAGTT...TGTAACTTGACT/CTGTTTGTAACT...TGCAG|GGC | 0 | 1 | 33.146 |
| 151909888 | GT-AG | 0 | 5.358870786713353e-05 | 1419 | rna-XM_031600387.1 27300298 | 14 | 10334408 | 10335826 | Phasianus colchicus 9054 | GAG|GTATTGGTGA...TGCTTCCTAATA/TGCTTCCTAATA...CATAG|ACC | 0 | 1 | 36.635 |
| 151909889 | GT-AG | 0 | 1.0297551494974262e-05 | 463 | rna-XM_031600387.1 27300298 | 15 | 10333823 | 10334285 | Phasianus colchicus 9054 | GAG|GTCTGTCTAT...CTGGCTGTGATA/CTGGCTGTGATA...ACCAG|GAT | 2 | 1 | 38.923 |
| 151909890 | GT-AG | 0 | 1.000000099473604e-05 | 1017 | rna-XM_031600387.1 27300298 | 16 | 10332685 | 10333701 | Phasianus colchicus 9054 | TCT|GTGAGTAATG...TTCCTCTTAACG/CTGGTTTTCAGC...CAAAG|CTT | 0 | 1 | 41.193 |
| 151909891 | GT-AG | 0 | 1.000000099473604e-05 | 1898 | rna-XM_031600387.1 27300298 | 17 | 10330609 | 10332506 | Phasianus colchicus 9054 | TCG|GTAAGGCCTG...TGGTTTTTAACC/TGGTTTTTAACC...TGCAG|CAT | 1 | 1 | 44.532 |
| 151909892 | GT-AG | 0 | 1.000000099473604e-05 | 1109 | rna-XM_031600387.1 27300298 | 18 | 10329356 | 10330464 | Phasianus colchicus 9054 | ACC|GTGAGTACTC...ACAGCTTTGCTT/GGCTGGGTGATG...TCCAG|CGT | 1 | 1 | 47.233 |
| 151909893 | GT-AG | 0 | 1.000000099473604e-05 | 1010 | rna-XM_031600387.1 27300298 | 19 | 10328114 | 10329123 | Phasianus colchicus 9054 | AAG|GTGAGGGGAT...ATATTTTTGACT/ATATTTTTGACT...GACAG|GTG | 2 | 1 | 51.585 |
| 151909894 | GT-AG | 0 | 1.000000099473604e-05 | 780 | rna-XM_031600387.1 27300298 | 20 | 10327176 | 10327955 | Phasianus colchicus 9054 | CAG|GTAAGGAGCA...ATCCACTTGAAT/TGAATTCTCATT...GACAG|GCA | 1 | 1 | 54.549 |
| 151909895 | GT-AG | 0 | 1.000000099473604e-05 | 1676 | rna-XM_031600387.1 27300298 | 21 | 10325275 | 10326950 | Phasianus colchicus 9054 | GAA|GTAAGTGGGC...TTGTCCTTCTCT/GTTTGAATCACC...TTCAG|TCT | 1 | 1 | 58.769 |
| 151909896 | GT-AG | 0 | 1.000000099473604e-05 | 1196 | rna-XM_031600387.1 27300298 | 22 | 10323858 | 10325053 | Phasianus colchicus 9054 | CAG|GTAAGGAAAT...TAAATCTTGAAC/CAGTTGTTAAAT...TCCAG|CTT | 0 | 1 | 62.915 |
| 151909897 | GT-AG | 0 | 4.553372113219954e-05 | 1115 | rna-XM_031600387.1 27300298 | 23 | 10322646 | 10323760 | Phasianus colchicus 9054 | TTT|GTAAGTATTA...CTTGCTTTCCCT/TTGCATATAACT...TGCAG|CAT | 1 | 1 | 64.735 |
| 151909898 | GT-AG | 0 | 1.000000099473604e-05 | 473 | rna-XM_031600387.1 27300298 | 24 | 10321797 | 10322269 | Phasianus colchicus 9054 | TCA|GTAAGGGCTC...TTCTGCATAACT/TTCAGTCTGACT...TGTAG|ACA | 2 | 1 | 71.788 |
| 151909899 | GT-AG | 0 | 1.000000099473604e-05 | 2996 | rna-XM_031600387.1 27300298 | 25 | 10318640 | 10321635 | Phasianus colchicus 9054 | AAG|GTAAGGCAGC...CTATTTCTATTG/TTGGTAGTCAAT...TACAG|AGG | 1 | 1 | 74.808 |
| 151909900 | GT-AG | 0 | 1.929498639013358e-05 | 852 | rna-XM_031600387.1 27300298 | 26 | 10317582 | 10318433 | Phasianus colchicus 9054 | AAG|GTAGGTGTTG...GAAGTCTTGACA/GTGTATCTCAGA...TGCAG|ATC | 0 | 1 | 78.672 |
| 151909901 | GT-AG | 0 | 0.0007660834158333 | 1747 | rna-XM_031600387.1 27300298 | 27 | 10315670 | 10317416 | Phasianus colchicus 9054 | CAA|GTATGTGTAT...CTCACCTTGCCA/CAAGTAGTGAAT...CGTAG|TAT | 0 | 1 | 81.767 |
| 151909902 | GT-AG | 0 | 1.000000099473604e-05 | 1807 | rna-XM_031600387.1 27300298 | 28 | 10313709 | 10315515 | Phasianus colchicus 9054 | CAG|GTAAATATGC...TCAAGCTCAATT/CTCAAGCTCAAT...TGCAG|ATG | 1 | 1 | 84.656 |
| 151909903 | GT-AG | 0 | 1.000000099473604e-05 | 601 | rna-XM_031600387.1 27300298 | 29 | 10312900 | 10313500 | Phasianus colchicus 9054 | CGA|GTGAGTAAGC...CTATTTTTACAT/TCTATTTTTACA...TTTAG|CAA | 2 | 1 | 88.557 |
| 151909904 | GT-AG | 0 | 0.0522958633877742 | 223 | rna-XM_031600387.1 27300298 | 30 | 10312538 | 10312760 | Phasianus colchicus 9054 | CAG|GTATCTTGCT...CAGCCTCCAACT/CAGCCTCCAACT...TCCAG|GCT | 0 | 1 | 91.165 |
| 151909905 | GT-AG | 0 | 0.0006348863967888 | 1017 | rna-XM_031600387.1 27300298 | 31 | 10311344 | 10312360 | Phasianus colchicus 9054 | AAC|GTAAGCCTTT...TGCTCCTAAACC/GGTTAATTAACA...TACAG|GAA | 0 | 1 | 94.485 |
| 151909906 | GT-AG | 0 | 1.000000099473604e-05 | 1210 | rna-XM_031600387.1 27300298 | 32 | 10309980 | 10311189 | Phasianus colchicus 9054 | CAG|GTGATGTTAT...GTGACTTTACTC/ATTTTGCTTACA...TACAG|ATT | 1 | 1 | 97.374 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);