introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
36 rows where transcript_id = 25387373
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 140015065 | GT-AG | 0 | 1.000000099473604e-05 | 12356 | rna-XM_040241761.1 25387373 | 1 | 108097019 | 108109374 | Oryx dammah 59534 | CAG|GTGCGGCCCG...CTCTCTTTTGTT/CTGCTGCTCAGA...CCTAG|TGG | 2 | 1 | 1.001 |
| 140015066 | GT-AG | 0 | 1.000000099473604e-05 | 3363 | rna-XM_040241761.1 25387373 | 2 | 108093588 | 108096950 | Oryx dammah 59534 | GTG|GTAAGTACAC...TCTGTTTTGCTT/CCTGTTGTGACA...TATAG|TGC | 1 | 1 | 2.216 |
| 140015067 | GT-AG | 0 | 1.000000099473604e-05 | 15871 | rna-XM_040241761.1 25387373 | 3 | 108077649 | 108093519 | Oryx dammah 59534 | AAC|GTAAGTAGAG...CTTTCCATAACC/TGTGTGCTTATT...TCTAG|CAA | 0 | 1 | 3.432 |
| 140015068 | GT-AG | 0 | 1.000000099473604e-05 | 3814 | rna-XM_040241761.1 25387373 | 4 | 108073724 | 108077537 | Oryx dammah 59534 | ACC|GTGAGTGCCC...TGTTTCTTTTCT/TCTCTTCTGATC...CGAAG|AAC | 0 | 1 | 5.416 |
| 140015069 | GT-AG | 0 | 1.000000099473604e-05 | 6354 | rna-XM_040241761.1 25387373 | 5 | 108067293 | 108073646 | Oryx dammah 59534 | CTC|GTGAGTACCT...TGCATTTTACGC/GTGCATTTTACG...CACAG|CAA | 2 | 1 | 6.792 |
| 140015070 | GC-AG | 0 | 1.000000099473604e-05 | 3089 | rna-XM_040241761.1 25387373 | 6 | 108064080 | 108067168 | Oryx dammah 59534 | GAG|GCGAGTCTGC...ATGCATTTAATT/ATGCATTTAATT...TCTAG|GAT | 0 | 1 | 9.008 |
| 140015071 | GT-AG | 0 | 1.000000099473604e-05 | 4922 | rna-XM_040241761.1 25387373 | 7 | 108059003 | 108063924 | Oryx dammah 59534 | CAG|GTGGGCTCGC...TAATATTTAAAT/TTAGAATTAATA...TTCAG|ACT | 2 | 1 | 11.778 |
| 140015072 | GT-AG | 0 | 1.000000099473604e-05 | 1123 | rna-XM_040241761.1 25387373 | 8 | 108057717 | 108058839 | Oryx dammah 59534 | GAG|GTGAGGCATG...GGTTCCTTCCAG/CAGGCACTAATG...CTTAG|GCT | 0 | 1 | 14.692 |
| 140015073 | GT-AG | 0 | 0.000372662620894 | 845 | rna-XM_040241761.1 25387373 | 9 | 108056818 | 108057662 | Oryx dammah 59534 | CAG|GTATGTTGAT...TGTTTCTAATCT/CTGTTTCTAATC...TTTAG|GCA | 0 | 1 | 15.657 |
| 140015074 | GT-AG | 0 | 0.0046714672684914 | 2731 | rna-XM_040241761.1 25387373 | 10 | 108053967 | 108056697 | Oryx dammah 59534 | GAG|GTATGTCGGA...TCTTCCTTATCT/GTCTTCCTTATC...CCCAG|ATT | 0 | 1 | 17.802 |
| 140015075 | GT-AG | 0 | 1.000000099473604e-05 | 7203 | rna-XM_040241761.1 25387373 | 11 | 108046671 | 108053873 | Oryx dammah 59534 | CAG|GTAAGTGTTG...TGAGCTGTGACA/TGTTCACTGAGC...TCTAG|GGG | 0 | 1 | 19.464 |
| 140015076 | GT-AG | 0 | 1.000000099473604e-05 | 22531 | rna-XM_040241761.1 25387373 | 12 | 108023921 | 108046451 | Oryx dammah 59534 | CAC|GTGAGTGCTG...GTTACCTTGAGA/GTTACCTTGAGA...GGTAG|GAC | 0 | 1 | 23.378 |
| 140015077 | GT-AG | 0 | 0.0155987026215333 | 1389 | rna-XM_040241761.1 25387373 | 13 | 108022361 | 108023749 | Oryx dammah 59534 | CAG|GTACCACGCT...TTTTCCTTGATG/TGAAAACTCACA...CTTAG|AAA | 0 | 1 | 26.434 |
| 140015078 | GT-AG | 0 | 1.000000099473604e-05 | 9474 | rna-XM_040241761.1 25387373 | 14 | 108012746 | 108022219 | Oryx dammah 59534 | GAG|GTGAGCCCGC...CTCCTCTTAATG/TCTCCTCTTAAT...TTCAG|GAA | 0 | 1 | 28.954 |
| 140015079 | GT-AG | 0 | 1.000000099473604e-05 | 5655 | rna-XM_040241761.1 25387373 | 15 | 108006993 | 108012647 | Oryx dammah 59534 | CAG|GTAAGGGTTT...AGATCTTTTATG/TCTTTTATGAAT...CAAAG|TGT | 2 | 1 | 30.706 |
| 140015080 | GT-AG | 0 | 0.0004797354343244 | 1692 | rna-XM_040241761.1 25387373 | 16 | 108005170 | 108006861 | Oryx dammah 59534 | AAG|GTACTCGCTC...TTTCTCTTTTTT/TTCTTTCTCTTT...TGCAG|AAG | 1 | 1 | 33.047 |
| 140015081 | GT-AG | 0 | 2.0319657066907448e-05 | 2074 | rna-XM_040241761.1 25387373 | 17 | 108002986 | 108005059 | Oryx dammah 59534 | AAG|GTATTGACTT...CTGCTCTGATCT/CCTGCTCTGATC...CCCAG|GTC | 0 | 1 | 35.013 |
| 140015082 | GT-AG | 0 | 1.000000099473604e-05 | 6088 | rna-XM_040241761.1 25387373 | 18 | 107996772 | 108002859 | Oryx dammah 59534 | GAG|GTAAGAAGGA...TGCGTTTTCACA/TGCGTTTTCACA...TATAG|CTT | 0 | 1 | 37.265 |
| 140015083 | GT-AG | 0 | 1.000000099473604e-05 | 1789 | rna-XM_040241761.1 25387373 | 19 | 107994887 | 107996675 | Oryx dammah 59534 | AAG|GTAGGAACAC...GATACTTTGATT/CATTATCTGACC...TGCAG|ACT | 0 | 1 | 38.981 |
| 140015084 | GT-AG | 0 | 2.355082219880009e-05 | 11481 | rna-XM_040241761.1 25387373 | 20 | 107983218 | 107994698 | Oryx dammah 59534 | CAG|GTAGACACGG...GCTTCCTTTCCC/CAGAAACTGAGT...TGGAG|AAA | 2 | 1 | 42.341 |
| 140015085 | GT-AG | 0 | 3.893586198443968e-05 | 5111 | rna-XM_040241761.1 25387373 | 21 | 107977662 | 107982772 | Oryx dammah 59534 | AAA|GTAAGCTCTC...TGGTCATTGTTC/AGGGTGGTCATT...TTCAG|GAT | 0 | 1 | 50.295 |
| 140015086 | GT-AG | 0 | 1.000000099473604e-05 | 6587 | rna-XM_040241761.1 25387373 | 22 | 107970843 | 107977429 | Oryx dammah 59534 | AAG|GTAAGCACCA...CGTGGTTTGATT/CGTGGTTTGATT...TCCAG|GCT | 1 | 1 | 54.441 |
| 140015087 | GT-AG | 0 | 6.496060136379424e-05 | 2003 | rna-XM_040241761.1 25387373 | 23 | 107968278 | 107970280 | Oryx dammah 59534 | CAG|GTACATCCCA...GTTTCCTTTTCC/TCAGTTCTAAAA...ATCAG|GTA | 2 | 1 | 64.486 |
| 140015088 | GT-AG | 0 | 1.000000099473604e-05 | 6339 | rna-XM_040241761.1 25387373 | 24 | 107961706 | 107968044 | Oryx dammah 59534 | AGG|GTGAGTGAGG...TTTTCCTTCTCG/TCTCTTCTAATG...TCCAG|GAA | 1 | 1 | 68.651 |
| 140015089 | GT-AG | 0 | 1.000000099473604e-05 | 2529 | rna-XM_040241761.1 25387373 | 25 | 107959128 | 107961656 | Oryx dammah 59534 | GAG|GTGAGTGTGC...GGGGTCTTAGTG/TTCTCTTTCAAT...TCAAG|GAC | 2 | 1 | 69.526 |
| 140015090 | GT-AG | 0 | 0.0001195836979599 | 3973 | rna-XM_040241761.1 25387373 | 26 | 107954982 | 107958954 | Oryx dammah 59534 | AAG|GTAAGCTGTG...CAGACTTTAAAT/TTTAAATTAAAA...TGTAG|ACT | 1 | 1 | 72.618 |
| 140015091 | GT-AG | 0 | 1.000000099473604e-05 | 6154 | rna-XM_040241761.1 25387373 | 27 | 107948708 | 107954861 | Oryx dammah 59534 | ACG|GTAGGGCGAA...CCATCCATAAAT/TAAATTGTGACT...TATAG|GTT | 1 | 1 | 74.763 |
| 140015092 | GT-AG | 0 | 1.000000099473604e-05 | 2440 | rna-XM_040241761.1 25387373 | 28 | 107946151 | 107948590 | Oryx dammah 59534 | GAG|GTAAAAACTA...ATATGCGTAATG/ATGAGGTTGAGA...TGCAG|ATA | 1 | 1 | 76.854 |
| 140015093 | GT-AG | 0 | 1.000000099473604e-05 | 882 | rna-XM_040241761.1 25387373 | 29 | 107945079 | 107945960 | Oryx dammah 59534 | CAG|GTAGGGCCAG...CTCCCCCTGCCG/GTGAGACTAACC...TCCAG|GCT | 2 | 1 | 80.25 |
| 140015094 | GT-AG | 0 | 1.000000099473604e-05 | 1051 | rna-XM_040241761.1 25387373 | 30 | 107943906 | 107944956 | Oryx dammah 59534 | AAG|GTACTGGGCA...TTTCCCTTTGCT/CCTTTGCTCATG...CGCAG|AGG | 1 | 1 | 82.431 |
| 140015095 | GT-AG | 0 | 0.0007231875630709 | 2472 | rna-XM_040241761.1 25387373 | 31 | 107941185 | 107943656 | Oryx dammah 59534 | CCG|GTACTGTGCT...TTTCCCTTATTC/ATTTCCCTTATT...TGCAG|ATG | 1 | 1 | 86.881 |
| 140015096 | GT-AG | 0 | 1.000000099473604e-05 | 1115 | rna-XM_040241761.1 25387373 | 32 | 107939992 | 107941106 | Oryx dammah 59534 | CAG|GTACAAAGCT...GAGATCTCACCT/GGAGATCTCACC...TGCAG|CTT | 1 | 1 | 88.275 |
| 140015097 | GT-AG | 0 | 1.000000099473604e-05 | 2286 | rna-XM_040241761.1 25387373 | 33 | 107937440 | 107939725 | Oryx dammah 59534 | AAG|GTGACTTCAG...TGATCTTTCTCC/CTGCATCTAACA...CGCAG|GGT | 0 | 1 | 93.029 |
| 140015098 | GT-AG | 0 | 1.000000099473604e-05 | 9384 | rna-XM_040241761.1 25387373 | 34 | 107927912 | 107937295 | Oryx dammah 59534 | CAG|GTGGGTGGCG...TTTTTCTAAATC/TTTTTTCTAAAT...TTTAG|ACG | 0 | 1 | 95.603 |
| 140015099 | GT-AG | 0 | 1.000000099473604e-05 | 2772 | rna-XM_040241761.1 25387373 | 35 | 107925013 | 107927784 | Oryx dammah 59534 | AGT|GTGAGTGGCG...TCTTTCTTCTCT/CCATGTCTGATC...CTTAG|TGG | 1 | 1 | 97.873 |
| 140015100 | GT-AG | 0 | 1.000000099473604e-05 | 414 | rna-XM_040241761.1 25387373 | 36 | 107924552 | 107924965 | Oryx dammah 59534 | GAG|GTAATGTCGC...AAATTCTCATTT/AAAATTCTCATT...CTCAG|GCC | 0 | 1 | 98.713 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);