introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
13 rows where transcript_id = 26701845
This data as json, CSV (advanced)
Suggested facets: score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
148094846 | GT-AG | 0 | 1.000000099473604e-05 | 14581 | rna-XM_009492168.1 26701845 | 1 | 29310 | 43890 | Pelecanus crispus 36300 | TAT|GTGAGTGATA...CTTTTCTTGTTT/ATTTTTTTCATT...TGTAG|AAG | 2 | 1 | 3.161 |
148094847 | GT-AG | 0 | 1.000000099473604e-05 | 15344 | rna-XM_009492168.1 26701845 | 2 | 44135 | 59478 | Pelecanus crispus 36300 | ATG|GTGAGTTTCA...ATCCTTCTGACA/CTATGTTTCACA...TGCAG|GAG | 0 | 1 | 13.177 |
148094848 | GT-AG | 0 | 0.0009596111607603 | 41702 | rna-XM_009492168.1 26701845 | 3 | 59690 | 101391 | Pelecanus crispus 36300 | AGG|GTATGTGCTG...TTTTCCTTATGT/ATATATTTGATA...AATAG|ATT | 1 | 1 | 21.839 |
148094849 | GT-AG | 0 | 0.0029303845908356 | 28053 | rna-XM_009492168.1 26701845 | 4 | 101482 | 129534 | Pelecanus crispus 36300 | AAG|GTACCTCACT...TATCTCATATTT/GTCTATCTCATA...TTTAG|GTA | 1 | 1 | 25.534 |
148094850 | GT-AG | 0 | 1.000000099473604e-05 | 3564 | rna-XM_009492168.1 26701845 | 5 | 129708 | 133271 | Pelecanus crispus 36300 | CAG|GTGAGGCTCC...ACTGCTTTAATT/ACTGCTTTAATT...AATAG|ATT | 0 | 1 | 32.635 |
148094851 | GT-AG | 0 | 1.000000099473604e-05 | 45053 | rna-XM_009492168.1 26701845 | 6 | 133341 | 178393 | Pelecanus crispus 36300 | CAG|GTCAGTGTGT...TATTTCTGATCA/CTATTTCTGATC...AGCAG|GTT | 0 | 1 | 35.468 |
148094852 | GT-AG | 0 | 1.000000099473604e-05 | 8677 | rna-XM_009492168.1 26701845 | 7 | 178515 | 187191 | Pelecanus crispus 36300 | CAG|GTAAGCTGAC...ATTGTCTGAACC/TCTCTGCTCATT...GACAG|GTG | 1 | 1 | 40.435 |
148094853 | GT-AG | 0 | 9.702352432511884e-05 | 24232 | rna-XM_009492168.1 26701845 | 8 | 187272 | 211503 | Pelecanus crispus 36300 | CAG|GTATGTGCCA...ATCTTCTTGTCT/TCTTTTTTTAGG...TCCAG|CAA | 0 | 1 | 43.719 |
148094854 | GT-AG | 0 | 0.0002168098002145 | 2912 | rna-XM_009492168.1 26701845 | 10 | 215728 | 218639 | Pelecanus crispus 36300 | TAG|GTAACTGATC...ATTACCTAAATA/CAGATTCTAATT...TATAG|ATA | 1 | 1 | 56.691 |
148094855 | GT-AG | 0 | 1.000000099473604e-05 | 27227 | rna-XM_009492168.1 26701845 | 11 | 218831 | 246057 | Pelecanus crispus 36300 | AAG|GTAATGTCTT...GGAGCATTAAGA/TGAGAACTGACC...TCCAG|GAA | 0 | 1 | 64.532 |
148094856 | GT-AG | 0 | 1.000000099473604e-05 | 16996 | rna-XM_009492168.1 26701845 | 12 | 246167 | 263162 | Pelecanus crispus 36300 | AAG|GTGAGTTGCA...CCTGCCCTGACA/CCTGCCCTGACA...TACAG|GAA | 1 | 1 | 69.007 |
148094857 | GT-AG | 0 | 1.000000099473604e-05 | 2508 | rna-XM_009492168.1 26701845 | 13 | 263397 | 265904 | Pelecanus crispus 36300 | TAG|GTAAGTGCAA...CCTGCTTTACTT/TGAGCACTCATT...AATAG|GAT | 1 | 1 | 78.612 |
148094858 | GT-AG | 0 | 1.000000099473604e-05 | 7928 | rna-XM_009492168.1 26701845 | 14 | 266122 | 274049 | Pelecanus crispus 36300 | AGG|GTGAGTACTA...CCTCCTTTGAAC/CCTCCTTTGAAC...TCCAG|ACA | 2 | 1 | 87.521 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);