introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
21 rows where transcript_id = 26701895
This data as json, CSV (advanced)
Suggested facets: score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
148095278 | GT-AG | 0 | 1.000000099473604e-05 | 3359 | rna-XM_009482089.1 26701895 | 1 | 112769 | 116127 | Pelecanus crispus 36300 | CCG|GTGAGTGTTA...TAGTTCTCATTT/ATAGTTCTCATT...TGCAG|AAA | 0 | 1 | 5.978 |
148095279 | GT-AG | 0 | 1.094975254904561e-05 | 624 | rna-XM_009482089.1 26701895 | 2 | 112057 | 112680 | Pelecanus crispus 36300 | AAG|GTATGTGGCC...ATTTGCTTGTTT/AAGTATCTGATG...TTTAG|CTG | 1 | 1 | 9.964 |
148095280 | GT-AG | 0 | 1.000000099473604e-05 | 1872 | rna-XM_009482089.1 26701895 | 3 | 110043 | 111914 | Pelecanus crispus 36300 | CAA|GTAAGTGAAG...GTTTTCTTTCCT/TCGTACATCACC...CACAG|GAC | 2 | 1 | 16.395 |
148095281 | GT-AG | 0 | 1.000000099473604e-05 | 1110 | rna-XM_009482089.1 26701895 | 4 | 108840 | 109949 | Pelecanus crispus 36300 | TGT|GTAAGTAGCT...ATATTTTTGTTC/GCAATGGTGATA...TGAAG|GAT | 2 | 1 | 20.607 |
148095282 | GT-AG | 0 | 1.804312475243123e-05 | 5162 | rna-XM_009482089.1 26701895 | 5 | 103578 | 108739 | Pelecanus crispus 36300 | CAT|GTAAGATTTC...TGTGCGTTAGCT/GTGACATTGATA...TTTAG|GCC | 0 | 1 | 25.136 |
148095283 | GT-AG | 0 | 1.000000099473604e-05 | 2155 | rna-XM_009482089.1 26701895 | 6 | 101332 | 103486 | Pelecanus crispus 36300 | GAG|GTCAGTGTTG...GTGCTTTTATTT/TTATTTCTGACT...TTCAG|GGG | 1 | 1 | 29.257 |
148095284 | GT-AG | 0 | 1.000000099473604e-05 | 3190 | rna-XM_009482089.1 26701895 | 7 | 98064 | 101253 | Pelecanus crispus 36300 | TTG|GTGAGTGGAA...TGAGCCTCAGTC/CTGAGCCTCAGT...TTCAG|TCG | 1 | 1 | 32.79 |
148095285 | GT-AG | 0 | 1.000000099473604e-05 | 1245 | rna-XM_009482089.1 26701895 | 8 | 96736 | 97980 | Pelecanus crispus 36300 | GGC|GTAAGTGTGT...AGTTACTCACCT/TAGTTACTCACC...CCTAG|TTT | 0 | 1 | 36.549 |
148095286 | GT-AG | 0 | 1.000000099473604e-05 | 331 | rna-XM_009482089.1 26701895 | 9 | 96318 | 96648 | Pelecanus crispus 36300 | TCC|GTAAGGGATG...CCCATCTTACCT/TGCCTTTTCACC...TATAG|AAA | 0 | 1 | 40.489 |
148095287 | GT-AG | 0 | 1.000000099473604e-05 | 346 | rna-XM_009482089.1 26701895 | 10 | 95882 | 96227 | Pelecanus crispus 36300 | CAG|GTACAGTACA...AGGCCCATAGAA/GAAGGCTTCACA...CTTAG|GGA | 0 | 1 | 44.565 |
148095288 | GT-AG | 0 | 2.381395516117332e-05 | 2240 | rna-XM_009482089.1 26701895 | 11 | 93597 | 95836 | Pelecanus crispus 36300 | TTT|GTAAGTACTT...GCACTCTTACAT/TTATGAATCACT...TTCAG|GGA | 0 | 1 | 46.603 |
148095289 | GT-AG | 0 | 1.000000099473604e-05 | 1083 | rna-XM_009482089.1 26701895 | 12 | 92481 | 93563 | Pelecanus crispus 36300 | CAG|GTAAAAGTTA...TATTTCTTTGTA/TTTCTAGTCACT...TGCAG|TTC | 0 | 1 | 48.098 |
148095290 | GT-AG | 0 | 0.0001848849299819 | 2011 | rna-XM_009482089.1 26701895 | 13 | 90381 | 92391 | Pelecanus crispus 36300 | CAC|GTAAGTACTT...TTTTTCTTAATG/TTTTTTCTTAAT...CATAG|ACC | 2 | 1 | 52.129 |
148095291 | GT-AG | 0 | 1.2231815646614984e-05 | 1951 | rna-XM_009482089.1 26701895 | 14 | 88348 | 90298 | Pelecanus crispus 36300 | AAT|GTAAGTATAG...AGCCTTTTACTT/AAGCCTTTTACT...ATTAG|GAA | 0 | 1 | 55.842 |
148095292 | GT-AG | 0 | 1.000000099473604e-05 | 793 | rna-XM_009482089.1 26701895 | 15 | 87429 | 88221 | Pelecanus crispus 36300 | CAG|GTAAGCATAT...CCTTTCTCACCT/TCCTTTCTCACC...CGCAG|AAC | 0 | 1 | 61.549 |
148095293 | GT-AG | 0 | 1.000000099473604e-05 | 2650 | rna-XM_009482089.1 26701895 | 16 | 84614 | 87263 | Pelecanus crispus 36300 | TTA|GTGAGTGTGG...TTTTCCTTCCCC/ATATGTTTGATT...TTTAG|ATT | 0 | 1 | 69.022 |
148095294 | GT-AG | 0 | 1.000000099473604e-05 | 1637 | rna-XM_009482089.1 26701895 | 17 | 82930 | 84566 | Pelecanus crispus 36300 | TAG|GTAGGAATGG...TTCTCCTGCCCT/CCCTGTCTGACA...TGCAG|ATT | 2 | 1 | 71.15 |
148095295 | GT-AG | 0 | 0.0001990140809708 | 1401 | rna-XM_009482089.1 26701895 | 18 | 81418 | 82818 | Pelecanus crispus 36300 | CAA|GTAAGTTTCC...TGCTCTTTGTCT/CTGTACTTCATC...TCTAG|ATA | 2 | 1 | 76.178 |
148095296 | GT-AG | 0 | 2.4210062620606763e-05 | 2460 | rna-XM_009482089.1 26701895 | 19 | 78861 | 81320 | Pelecanus crispus 36300 | AAT|GTAAGTGTCC...GTACCTTTATTT/AAGGATCTGATC...GGTAG|GTT | 0 | 1 | 80.571 |
148095297 | GT-AG | 0 | 0.001260735811518 | 1086 | rna-XM_009482089.1 26701895 | 20 | 77628 | 78713 | Pelecanus crispus 36300 | CAG|GTACTCTGCT...AGGCCCTTCTCC/TACCCACTGACC...TTCAG|TCT | 0 | 1 | 87.228 |
148095298 | GT-AG | 0 | 2.315500811005712e-05 | 871 | rna-XM_009482089.1 26701895 | 21 | 76652 | 77522 | Pelecanus crispus 36300 | CAG|GTACGGTTCA...GGTCCCCTGACC/TCTGGACTGAGA...CCTAG|TGG | 0 | 1 | 91.984 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);