introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
29 rows where transcript_id = 32672031
This data as json, CSV (advanced)
Suggested facets: score, length, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 182504549 | GT-AG | 0 | 1.000000099473604e-05 | 1731 | rna-XM_009085910.3 32672031 | 1 | 20529672 | 20531402 | Serinus canaria 9135 | CTG|GTAAGTCTTT...TTTGCATTATCA/TATCATTTAATT...TATAG|GTA | 1 | 1 | 1.406 |
| 182504550 | GT-AG | 0 | 1.000000099473604e-05 | 1239 | rna-XM_009085910.3 32672031 | 2 | 20528034 | 20529272 | Serinus canaria 9135 | AGG|GTGAGTACTG...TGAGTATTGATC/TGAGTATTGATC...CATAG|ATA | 1 | 1 | 10.604 |
| 182504551 | GT-AG | 0 | 0.0024405790728289 | 892 | rna-XM_009085910.3 32672031 | 3 | 20526968 | 20527859 | Serinus canaria 9135 | AAG|GTAACTTATT...ATTATCTTGTCA/GCTGGTATCATC...TATAG|ACA | 1 | 1 | 14.615 |
| 182504552 | GT-AG | 0 | 1.000000099473604e-05 | 300 | rna-XM_009085910.3 32672031 | 4 | 20526503 | 20526802 | Serinus canaria 9135 | TAG|GTAAAGTAAA...TTTTTTTTGGCC/ACATGACTGATT...TTTAG|GGC | 1 | 1 | 18.419 |
| 182504553 | GT-AG | 0 | 1.000000099473604e-05 | 238 | rna-XM_009085910.3 32672031 | 5 | 20526148 | 20526385 | Serinus canaria 9135 | CAG|GTAAGTGCAC...TCACTGTTAACA/AAAATTTTCACT...TAAAG|GAA | 1 | 1 | 21.116 |
| 182504554 | GT-AG | 0 | 0.0003755309901913 | 1390 | rna-XM_009085910.3 32672031 | 6 | 20524614 | 20526003 | Serinus canaria 9135 | CAG|GTACACTGAT...AACTTCTCAATA/TCAATATTAACA...TTTAG|GTG | 1 | 1 | 24.435 |
| 182504555 | GT-AG | 0 | 1.000000099473604e-05 | 190 | rna-XM_009085910.3 32672031 | 7 | 20524241 | 20524430 | Serinus canaria 9135 | ACA|GTGAGTATTC...ATATCCATACCT/CTAAAATTAACT...CCCAG|AGC | 1 | 1 | 28.654 |
| 182504556 | GT-AG | 0 | 1.000000099473604e-05 | 791 | rna-XM_009085910.3 32672031 | 8 | 20523295 | 20524085 | Serinus canaria 9135 | GAG|GTAAAAATCC...TTCTTCTTACAA/TTTCTTCTTACA...CTTAG|GAT | 0 | 1 | 32.227 |
| 182504557 | GT-AG | 0 | 1.000000099473604e-05 | 685 | rna-XM_009085910.3 32672031 | 9 | 20522496 | 20523180 | Serinus canaria 9135 | AAA|GTAAGTGTAA...TGCTTCTGCACT/TGCAATCTGACA...GACAG|GGC | 0 | 1 | 34.855 |
| 182504558 | GT-AG | 0 | 1.000000099473604e-05 | 800 | rna-XM_009085910.3 32672031 | 10 | 20521580 | 20522379 | Serinus canaria 9135 | CAG|GTATGGAAAC...TGGTGCTTACTC/TTGGTGCTTACT...CACAG|ATA | 2 | 1 | 37.529 |
| 182504559 | GT-AG | 0 | 1.000000099473604e-05 | 426 | rna-XM_009085910.3 32672031 | 11 | 20521005 | 20521430 | Serinus canaria 9135 | CTG|GTAAGTGCCA...TAAGATTTAACA/TAAGATTTAACA...TGAAG|GAG | 1 | 1 | 40.964 |
| 182504560 | GT-AG | 0 | 2.1480334306777395e-05 | 381 | rna-XM_009085910.3 32672031 | 12 | 20520421 | 20520801 | Serinus canaria 9135 | AAA|GTAGGTGTTT...ATATCTGTAATA/CAGTATTTAATT...TGTAG|ATT | 0 | 1 | 45.643 |
| 182504561 | GT-AG | 0 | 0.0002725109863475 | 752 | rna-XM_009085910.3 32672031 | 13 | 20519547 | 20520298 | Serinus canaria 9135 | ACT|GTAAGTATTT...TTCTCTCTAATT/GTGTGTCTGATT...TTCAG|AGA | 2 | 1 | 48.456 |
| 182504562 | GT-AG | 0 | 1.000000099473604e-05 | 681 | rna-XM_009085910.3 32672031 | 14 | 20518778 | 20519458 | Serinus canaria 9135 | CCA|GTGAGTAAAT...TTACTTTTAAAA/TGATTGTTTACT...TACAG|GTA | 0 | 1 | 50.484 |
| 182504563 | GT-AG | 0 | 1.000000099473604e-05 | 457 | rna-XM_009085910.3 32672031 | 15 | 20518176 | 20518632 | Serinus canaria 9135 | AAG|GTAAGAGAAC...CAAATATTGATT/CAAATATTGATT...TGCAG|GAA | 1 | 1 | 53.827 |
| 182504564 | GT-AG | 0 | 4.4832011529388176e-05 | 88 | rna-XM_009085910.3 32672031 | 16 | 20518052 | 20518139 | Serinus canaria 9135 | TCA|GTAAGTCTTG...AATGTTTTCATT/AATGTTTTCATT...TGCAG|ATT | 1 | 1 | 54.657 |
| 182504565 | GT-AG | 0 | 1.000000099473604e-05 | 868 | rna-XM_009085910.3 32672031 | 17 | 20517014 | 20517881 | Serinus canaria 9135 | TAT|GTAAGAAACA...TCTGTTTTATTA/GTCTGTTTTATT...TTTAG|ATT | 0 | 1 | 58.575 |
| 182504566 | GT-AG | 0 | 1.000000099473604e-05 | 1652 | rna-XM_009085910.3 32672031 | 18 | 20515294 | 20516945 | Serinus canaria 9135 | CCG|GTAAGCAAAA...ATTCCCTCAAAT/TATTAACTAACT...TTCAG|GTG | 2 | 1 | 60.143 |
| 182504567 | GT-AG | 0 | 1.000000099473604e-05 | 886 | rna-XM_009085910.3 32672031 | 19 | 20514307 | 20515192 | Serinus canaria 9135 | CAG|GTGGGTGCTT...TTTGTTATAATT/TTTGTTATAATT...TTCAG|GTA | 1 | 1 | 62.471 |
| 182504568 | GT-AG | 0 | 1.000000099473604e-05 | 937 | rna-XM_009085910.3 32672031 | 20 | 20513227 | 20514163 | Serinus canaria 9135 | AAG|GTGCAGTATG...AGCATCTAGATT/TCTAGATTCAGG...TTCAG|TGT | 0 | 1 | 65.768 |
| 182504569 | GT-AG | 0 | 1.000000099473604e-05 | 1242 | rna-XM_009085910.3 32672031 | 21 | 20511870 | 20513111 | Serinus canaria 9135 | AAG|GTAAAACATA...CCTTCCTTACAC/TCCTTCCTTACA...TTCAG|CTT | 1 | 1 | 68.419 |
| 182504570 | GT-AG | 0 | 0.0002867740948861 | 1574 | rna-XM_009085910.3 32672031 | 22 | 20510147 | 20511720 | Serinus canaria 9135 | ATG|GTAAGCATCA...TTTTTCTTAATC/CTTAATCTCACT...GGTAG|GAT | 0 | 1 | 71.853 |
| 182504571 | GT-AG | 0 | 1.000000099473604e-05 | 752 | rna-XM_009085910.3 32672031 | 23 | 20509292 | 20510043 | Serinus canaria 9135 | CTG|GTGAGTTTTC...TGTTCTTTTGCT/ATAAGTCTGAGC...TACAG|ACC | 1 | 1 | 74.228 |
| 182504572 | GT-AG | 0 | 1.000000099473604e-05 | 1462 | rna-XM_009085910.3 32672031 | 24 | 20507597 | 20509058 | Serinus canaria 9135 | ATA|GTGAGTCCAA...ATAATCATAATT/ATAATTGTGATT...TTTAG|ACT | 0 | 1 | 79.599 |
| 182504573 | GT-AG | 0 | 2.8098292072285077e-05 | 996 | rna-XM_009085910.3 32672031 | 25 | 20506435 | 20507430 | Serinus canaria 9135 | ATG|GTAAGTTCCA...TTTCCCTTAGGT/TAGGTTTTAAAA...AATAG|AGT | 1 | 1 | 83.426 |
| 182504574 | GT-AG | 0 | 1.000000099473604e-05 | 274 | rna-XM_009085910.3 32672031 | 26 | 20506008 | 20506281 | Serinus canaria 9135 | TGG|GTAAGTGCCC...TAATCTTTAATT/CTTTAATTGATA...TTTAG|GTG | 1 | 1 | 86.953 |
| 182504575 | GT-AG | 0 | 1.000000099473604e-05 | 1432 | rna-XM_009085910.3 32672031 | 27 | 20504459 | 20505890 | Serinus canaria 9135 | ATG|GTAATTATTT...ACTTCATTCACT/CATTCACTCACT...TTTAG|GAG | 1 | 1 | 89.65 |
| 182504576 | GT-AG | 0 | 0.0034628543459176 | 1940 | rna-XM_009085910.3 32672031 | 28 | 20502351 | 20504290 | Serinus canaria 9135 | AAA|GTAAGTTTTG...TATTTCTTAATT/TATTTCTTAATT...TGAAG|TTG | 1 | 1 | 93.522 |
| 182504577 | GT-AG | 0 | 1.000000099473604e-05 | 403 | rna-XM_009085910.3 32672031 | 29 | 20501906 | 20502308 | Serinus canaria 9135 | ATG|GTAAGTTGGT...TGTGTTTTGTCT/TGGGCATTTAGC...GACAG|CAT | 1 | 1 | 94.491 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);