introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
26 rows where transcript_id = 34243593
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 191722248 | GT-AG | 0 | 1.000000099473604e-05 | 17945 | rna-XM_009671306.1 34243593 | 1 | 336792 | 354736 | Struthio camelus 8801 | AAG|GTGAGCAGGA...GTGTCTTTCTCA/GTCTTTCTCATG...TGCAG|GTC | 0 | 1 | 7.1 |
| 191722249 | GT-AG | 0 | 1.000000099473604e-05 | 1157 | rna-XM_009671306.1 34243593 | 2 | 354825 | 355981 | Struthio camelus 8801 | TAT|GTAAGTGCTT...TCCACCTTCCTC/ATGTGACCAACC...CCCAG|GCA | 1 | 1 | 9.609 |
| 191722250 | GT-AG | 0 | 1.000000099473604e-05 | 4948 | rna-XM_009671306.1 34243593 | 3 | 356166 | 361113 | Struthio camelus 8801 | TGG|GTAAGTGAGA...TCTTTCTCAATC/TTCTTTCTCAAT...TGCAG|CAC | 2 | 1 | 14.856 |
| 191722251 | GT-AG | 0 | 1.000000099473604e-05 | 9066 | rna-XM_009671306.1 34243593 | 4 | 361176 | 370241 | Struthio camelus 8801 | ATG|GTAAGTGCTT...GTTGCTCTAACA/GTTGCTCTAACA...TCAAG|ATC | 1 | 1 | 16.624 |
| 191722252 | GT-AG | 0 | 1.000000099473604e-05 | 592 | rna-XM_009671306.1 34243593 | 5 | 370543 | 371134 | Struthio camelus 8801 | AGA|GTAAGGCCTT...TCAGCCTTTGCT/TGCTAGCTCAGC...TGCAG|CCT | 2 | 1 | 25.207 |
| 191722253 | GT-AG | 0 | 0.0001635327749848 | 351 | rna-XM_009671306.1 34243593 | 6 | 371308 | 371658 | Struthio camelus 8801 | CAG|GTATGTGCTT...ATTTCCTTCTCA/AGTGATTTAAGT...CGCAG|GCC | 1 | 1 | 30.14 |
| 191722254 | GT-AG | 0 | 1.000000099473604e-05 | 9857 | rna-XM_009671306.1 34243593 | 8 | 371773 | 381629 | Struthio camelus 8801 | ACT|GTGAGTCATT...AGGGCTTTGGTC/TTTGGTCTAATG...CACAG|CCC | 2 | 1 | 33.333 |
| 191722255 | GC-AG | 0 | 1.000000099473604e-05 | 893 | rna-XM_009671306.1 34243593 | 9 | 381826 | 382718 | Struthio camelus 8801 | AAG|GCAAGCAGGA...AAGGTCCTGACT/AAGGTCCTGACT...TCCAG|CTA | 0 | 1 | 38.922 |
| 191722256 | GT-AG | 0 | 1.000000099473604e-05 | 303 | rna-XM_009671306.1 34243593 | 10 | 382796 | 383098 | Struthio camelus 8801 | CAC|GTGAGTATTA...GTGTCCTCACTC/AGTGTCCTCACT...CCCAG|GGT | 2 | 1 | 41.118 |
| 191722257 | GT-AG | 0 | 1.000000099473604e-05 | 353 | rna-XM_009671306.1 34243593 | 11 | 383210 | 383562 | Struthio camelus 8801 | GAG|GTGAGGAGTG...GCATCCTGATCT/GGCATCCTGATC...GTCAG|GTA | 2 | 1 | 44.283 |
| 191722258 | GT-AG | 0 | 1.000000099473604e-05 | 1820 | rna-XM_009671306.1 34243593 | 12 | 383654 | 385473 | Struthio camelus 8801 | AGA|GTAAGTCACC...GTTTTGTTACTG/CCTGTTTTCATG...TACAG|GCC | 0 | 1 | 46.878 |
| 191722259 | GT-AG | 0 | 1.000000099473604e-05 | 720 | rna-XM_009671306.1 34243593 | 13 | 385675 | 386394 | Struthio camelus 8801 | GAG|GTAAGAGCTG...CAACTCTCAATT/TCTCAATTTATG...CCCAG|GTG | 0 | 1 | 52.609 |
| 191722260 | GT-AG | 0 | 1.000000099473604e-05 | 681 | rna-XM_009671306.1 34243593 | 14 | 386504 | 387184 | Struthio camelus 8801 | CAG|GTAGGGCCCC...CCTGCCTAGACT/ACTACTGTGACC...CCCAG|GTG | 1 | 1 | 55.717 |
| 191722261 | GT-AG | 0 | 1.000000099473604e-05 | 1568 | rna-XM_009671306.1 34243593 | 15 | 387295 | 388862 | Struthio camelus 8801 | AAG|GTCAGTGGCA...GGCCCCTCAGAC/TCAGACTTCAGA...GGCAG|GAG | 0 | 1 | 58.854 |
| 191722262 | GT-AG | 0 | 1.000000099473604e-05 | 568 | rna-XM_009671306.1 34243593 | 16 | 389024 | 389591 | Struthio camelus 8801 | CAG|GTACAGACAT...TCCTCCTTTCTC/CTCAGCATAATT...TGCAG|CCA | 2 | 1 | 63.445 |
| 191722263 | GT-AG | 0 | 1.000000099473604e-05 | 329 | rna-XM_009671306.1 34243593 | 17 | 389665 | 389993 | Struthio camelus 8801 | AAG|GTAAGGAACC...GCACCCTGAGCA/GGCACCCTGAGC...TGCAG|TCA | 0 | 1 | 65.526 |
| 191722264 | GT-AG | 0 | 1.000000099473604e-05 | 903 | rna-XM_009671306.1 34243593 | 18 | 390083 | 390985 | Struthio camelus 8801 | CAG|GTAAGACTGC...CAGCCTATAATT/CAGCCTATAATT...TCCAG|TGT | 2 | 1 | 68.064 |
| 191722265 | GT-AG | 0 | 1.000000099473604e-05 | 674 | rna-XM_009671306.1 34243593 | 19 | 391055 | 391728 | Struthio camelus 8801 | TGG|GTAGGTAGCT...TCTCCCTTCACC/TCTCCCTTCACC...TGTAG|CTC | 2 | 1 | 70.031 |
| 191722266 | GT-AG | 0 | 0.0005590148270323 | 3399 | rna-XM_009671306.1 34243593 | 20 | 391810 | 395208 | Struthio camelus 8801 | ACG|GTATGTTACT...GAGGTTTTCACA/GAGGTTTTCACA...TTTAG|GTC | 2 | 1 | 72.341 |
| 191722267 | GT-AG | 0 | 1.000000099473604e-05 | 233 | rna-XM_009671306.1 34243593 | 21 | 395295 | 395527 | Struthio camelus 8801 | AAG|GTAGGTCACA...CCTCCCTTGGCT/CCTTGGCTTACT...TGCAG|TCA | 1 | 1 | 74.793 |
| 191722268 | GT-AG | 0 | 1.000000099473604e-05 | 881 | rna-XM_009671306.1 34243593 | 22 | 395690 | 396570 | Struthio camelus 8801 | CAG|GTAAGTACCA...CTCTCTTTAAGT/CTTTAAGTCACC...CACAG|ACC | 1 | 1 | 79.413 |
| 191722269 | GT-AG | 0 | 1.000000099473604e-05 | 1083 | rna-XM_009671306.1 34243593 | 23 | 396865 | 397947 | Struthio camelus 8801 | ATG|GTAGGTGGGC...CTCCTCTTAAAT/GTCTGTCTCACA...CACAG|GGC | 1 | 1 | 87.796 |
| 191722270 | GT-AG | 0 | 1.000000099473604e-05 | 384 | rna-XM_009671306.1 34243593 | 24 | 398036 | 398419 | Struthio camelus 8801 | CAG|GTGAGCGAGG...TGCTCTCTCTCT/CAAAGTCTAATG...CTCAG|GAC | 2 | 1 | 90.305 |
| 191722271 | GT-AG | 0 | 1.000000099473604e-05 | 501 | rna-XM_009671306.1 34243593 | 25 | 398495 | 398995 | Struthio camelus 8801 | GAA|GTGAGTGAAT...CCTTCCATGATC/CATGATCTCACT...TCTAG|CAC | 2 | 1 | 92.444 |
| 191722272 | GT-AG | 0 | 1.000000099473604e-05 | 1121 | rna-XM_009671306.1 34243593 | 26 | 399099 | 400219 | Struthio camelus 8801 | CAG|GTAGGGAGGT...GGTCCCCTAATG/ACTAGTCTCATC...CCCAG|GAA | 0 | 1 | 95.381 |
| 191722273 | GT-AG | 0 | 1.000000099473604e-05 | 410 | rna-XM_009671306.1 34243593 | 27 | 400295 | 400704 | Struthio camelus 8801 | CCT|GTGAGTACAA...TTTGCTTTATCA/TTTTGCTTTATC...CATAG|GGT | 0 | 1 | 97.519 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);