introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
35 rows where transcript_id = 19079898
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 101756222 | GT-AG | 0 | 1.000000099473604e-05 | 15947 | rna-XM_042872593.1 19079898 | 1 | 33678511 | 33694457 | Lagopus leucura 30410 | GAG|GTGAGTGTGG...CATTTCTTTTCT/CGACTACTGAAT...CTTAG|GCT | 2 | 1 | 3.839 |
| 101756223 | GT-AG | 0 | 1.000000099473604e-05 | 4052 | rna-XM_042872593.1 19079898 | 2 | 33694565 | 33698616 | Lagopus leucura 30410 | AAG|GTAAAAGCAA...TTACTTTTGATT/ATCCTTCTCATT...ACTAG|ATG | 1 | 1 | 6.488 |
| 101756224 | GT-AG | 0 | 8.722276489371117e-05 | 471 | rna-XM_042872593.1 19079898 | 3 | 33698718 | 33699188 | Lagopus leucura 30410 | GAG|GTAATCATGT...AGCTTTTTGAAA/TGTAGGCTTACA...TTCAG|CTC | 0 | 1 | 8.99 |
| 101756225 | GT-AG | 0 | 1.1556695537179956e-05 | 475 | rna-XM_042872593.1 19079898 | 4 | 33699411 | 33699885 | Lagopus leucura 30410 | CAG|GTATTAAAAA...TCTGTTTTATTC/ATTCTTTTCATC...GTTAG|GTA | 0 | 1 | 14.487 |
| 101756226 | GT-AG | 0 | 1.000000099473604e-05 | 1332 | rna-XM_042872593.1 19079898 | 5 | 33700009 | 33701340 | Lagopus leucura 30410 | GAA|GTAAGTCAGT...GAAAACTGAGCT/CGTGAGCTGATG...TTCAG|GTG | 0 | 1 | 17.533 |
| 101756227 | GT-AG | 0 | 1.000000099473604e-05 | 3108 | rna-XM_042872593.1 19079898 | 6 | 33701422 | 33704529 | Lagopus leucura 30410 | AAG|GTAGGTTAAA...TTTGTTTTACAT/GTTTGTTTTACA...AACAG|ATC | 0 | 1 | 19.539 |
| 101756228 | GT-AG | 0 | 1.9709820419195664e-05 | 3722 | rna-XM_042872593.1 19079898 | 7 | 33704695 | 33708416 | Lagopus leucura 30410 | CAG|GTATGGGAAA...TTAACCTTAATA/GCTTATTTTACT...CACAG|CTT | 0 | 1 | 23.626 |
| 101756229 | GT-AG | 0 | 1.000000099473604e-05 | 1305 | rna-XM_042872593.1 19079898 | 8 | 33708521 | 33709825 | Lagopus leucura 30410 | TGG|GTAAGATACA...AGCAGCTTGAAG/ATTGGGATAATA...TTTAG|GGG | 2 | 1 | 26.201 |
| 101756230 | GT-AG | 0 | 1.000000099473604e-05 | 2257 | rna-XM_042872593.1 19079898 | 9 | 33709873 | 33712129 | Lagopus leucura 30410 | CAG|GTGAGGATAG...AAGTTGTTAGTG/AAAACTCTGATT...CTTAG|GTT | 1 | 1 | 27.365 |
| 101756231 | GT-AG | 0 | 2.62814665379387e-05 | 146 | rna-XM_042872593.1 19079898 | 10 | 33712185 | 33712330 | Lagopus leucura 30410 | AAA|GTAAGTTAAA...TGTTTCTTATAT/TTGTTTCTTATA...TTTAG|GTG | 2 | 1 | 28.727 |
| 101756232 | GT-AG | 0 | 8.339157859488216e-05 | 637 | rna-XM_042872593.1 19079898 | 11 | 33712416 | 33713052 | Lagopus leucura 30410 | AAA|GTAAATATTC...CTATCTTTATAC/CCTATCTTTATA...GAAAG|GTT | 0 | 1 | 30.832 |
| 101756233 | GT-AG | 0 | 1.1800066794993274e-05 | 542 | rna-XM_042872593.1 19079898 | 12 | 33713111 | 33713652 | Lagopus leucura 30410 | ATG|GTAAGTCTTC...GACTACTTAATG/TGACTACTTAAT...TGCAG|TGA | 1 | 1 | 32.268 |
| 101756234 | GT-AG | 0 | 1.000000099473604e-05 | 832 | rna-XM_042872593.1 19079898 | 13 | 33713758 | 33714589 | Lagopus leucura 30410 | GAG|GTAAGGTGGC...CAGTTCATAAAC/AAACTGCTAAGT...TGAAG|GTC | 1 | 1 | 34.869 |
| 101756235 | GT-AG | 0 | 1.000000099473604e-05 | 379 | rna-XM_042872593.1 19079898 | 14 | 33714688 | 33715066 | Lagopus leucura 30410 | CTG|GTAAGAACTT...ATAACGTTGTCT/GTAAGAATAACG...TTCAG|ACT | 0 | 1 | 37.296 |
| 101756236 | GT-AG | 0 | 0.0001248797928131 | 633 | rna-XM_042872593.1 19079898 | 15 | 33715199 | 33715831 | Lagopus leucura 30410 | GAG|GTAGGCATTT...ACTTACTTGATC/CTTAGACTTACT...TGTAG|CTA | 0 | 1 | 40.565 |
| 101756237 | GT-AG | 0 | 1.000000099473604e-05 | 80 | rna-XM_042872593.1 19079898 | 16 | 33715995 | 33716074 | Lagopus leucura 30410 | TAG|GTAAAAGAAC...TTCATTTTGATT/AAGGTTTTCATT...TGTAG|CTG | 1 | 1 | 44.601 |
| 101756238 | GT-AG | 0 | 4.95996519176138e-05 | 509 | rna-XM_042872593.1 19079898 | 17 | 33716139 | 33716647 | Lagopus leucura 30410 | CAG|GTATTATAAT...AAATCCTTTTTT/ATGTTATTGAAA...GCCAG|GAG | 2 | 1 | 46.186 |
| 101756239 | GT-AG | 0 | 1.000000099473604e-05 | 1681 | rna-XM_042872593.1 19079898 | 18 | 33716712 | 33718392 | Lagopus leucura 30410 | AAG|GTAAGTGTTT...TTGTTATTATTA/TTTGTTATTATT...CTTAG|GCA | 0 | 1 | 47.771 |
| 101756240 | GT-AG | 0 | 1.000000099473604e-05 | 848 | rna-XM_042872593.1 19079898 | 19 | 33718438 | 33719285 | Lagopus leucura 30410 | AAG|GTTAGTTTAA...TTTTCTATATTA/TTACTTTTCATT...TCCAG|GAT | 0 | 1 | 48.886 |
| 101756241 | GT-AG | 0 | 1.000000099473604e-05 | 420 | rna-XM_042872593.1 19079898 | 20 | 33719340 | 33719759 | Lagopus leucura 30410 | GAG|GTGAGCTAAA...TTGGATTTGATC/TTGGATTTGATC...TTCAG|GTG | 0 | 1 | 50.223 |
| 101756242 | GT-AG | 0 | 1.000000099473604e-05 | 953 | rna-XM_042872593.1 19079898 | 21 | 33719877 | 33720829 | Lagopus leucura 30410 | CAG|GTAATTATTT...TAAGCTGTAATT/AATATTATAATA...TTCAG|GTT | 0 | 1 | 53.12 |
| 101756243 | GT-AG | 0 | 0.010195192222338 | 1010 | rna-XM_042872593.1 19079898 | 22 | 33720938 | 33721947 | Lagopus leucura 30410 | TTG|GTATGTTATG...CTTAACTTGACT/CTTAACTTGACT...AACAG|CTG | 0 | 1 | 55.795 |
| 101756244 | GT-AG | 0 | 1.000000099473604e-05 | 835 | rna-XM_042872593.1 19079898 | 23 | 33722049 | 33722883 | Lagopus leucura 30410 | AAG|GTAAGCACAA...GGATTTTTATTT/TGGATTTTTATT...TAAAG|GGC | 2 | 1 | 58.296 |
| 101756245 | GT-AG | 0 | 0.0286668129533314 | 539 | rna-XM_042872593.1 19079898 | 24 | 33723058 | 33723596 | Lagopus leucura 30410 | AAG|GTAACCATGT...AATGTTTTAATT/TAAATTTTCACT...TTCAG|AGT | 2 | 1 | 62.605 |
| 101756246 | GT-AG | 0 | 1.000000099473604e-05 | 1213 | rna-XM_042872593.1 19079898 | 25 | 33723709 | 33724921 | Lagopus leucura 30410 | GAG|GTAGGAGTGT...ATTGTCTAATTT/TATTGTCTAATT...CTCAG|ATA | 0 | 1 | 65.379 |
| 101756247 | GT-AG | 0 | 1.1013669557003792e-05 | 567 | rna-XM_042872593.1 19079898 | 26 | 33725135 | 33725701 | Lagopus leucura 30410 | GAG|GTACAGTAGT...TGGTTCTTTTTT/ACTCCACTGATG...GTCAG|GTT | 0 | 1 | 70.654 |
| 101756248 | GT-AG | 0 | 1.000000099473604e-05 | 1732 | rna-XM_042872593.1 19079898 | 27 | 33725792 | 33727523 | Lagopus leucura 30410 | CAG|GTAGGGATTT...TTGCTCTTGATG/ATCGTGCTCATC...TTTAG|GCT | 0 | 1 | 72.883 |
| 101756249 | GT-AG | 0 | 0.0027067237477466 | 428 | rna-XM_042872593.1 19079898 | 28 | 33727785 | 33728212 | Lagopus leucura 30410 | CAA|GTATGTATTC...CATGTTGTAATT/TAATATTTCATG...TGTAG|CAA | 0 | 1 | 79.346 |
| 101756250 | GT-AG | 0 | 1.000000099473604e-05 | 1052 | rna-XM_042872593.1 19079898 | 29 | 33728357 | 33729408 | Lagopus leucura 30410 | AAG|GTAATGCAAG...CATTCCTTGTCT/TGCTGGCTAACT...CAAAG|GTG | 0 | 1 | 82.912 |
| 101756251 | GT-AG | 0 | 3.5369378312742454e-05 | 407 | rna-XM_042872593.1 19079898 | 30 | 33729480 | 33729886 | Lagopus leucura 30410 | CAG|GTATGATAAC...AAATACTTATTT/AAAATACTTATT...TCCAG|AAA | 2 | 1 | 84.671 |
| 101756252 | GT-AG | 0 | 1.000000099473604e-05 | 155 | rna-XM_042872593.1 19079898 | 31 | 33729997 | 33730151 | Lagopus leucura 30410 | ATG|GTAATGCTTA...GTACTCTTCACG/AACTTCTTTATG...TATAG|GAT | 1 | 1 | 87.395 |
| 101756253 | GT-AG | 0 | 0.0096508080566165 | 644 | rna-XM_042872593.1 19079898 | 32 | 33730249 | 33730892 | Lagopus leucura 30410 | TAG|GTATTCACGT...GTTTCCTAATCT/AGTTTCCTAATC...TGCAG|GGT | 2 | 1 | 89.797 |
| 101756254 | GT-AG | 0 | 1.000000099473604e-05 | 1312 | rna-XM_042872593.1 19079898 | 33 | 33731032 | 33732343 | Lagopus leucura 30410 | GAG|GTAAAAATTC...ATTTTTTTAATA/ATTTTTTTAATA...TACAG|ATA | 0 | 1 | 93.239 |
| 101756255 | GT-AG | 0 | 0.0003660087009463 | 3275 | rna-XM_042872593.1 19079898 | 34 | 33732454 | 33735728 | Lagopus leucura 30410 | ACG|GTAATCATTT...TTATATTTAATT/TTATATTTAATT...TTCAG|TTT | 2 | 1 | 95.963 |
| 101756256 | GT-AG | 0 | 1.000000099473604e-05 | 1252 | rna-XM_042872593.1 19079898 | 35 | 33735796 | 33737047 | Lagopus leucura 30410 | CAA|GTGAGTGTAG...GAATTCTAACTT/TGAATTCTAACT...ATTAG|GTG | 0 | 1 | 97.623 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);