introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
13 rows where transcript_id = 26701860
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
148094962 | GT-AG | 0 | 1.000000099473604e-05 | 1045 | rna-XM_009477762.1 26701860 | 1 | 38031 | 39075 | Pelecanus crispus 36300 | GAG|GTAAGAGGTC...TTCTTTTTGTCT/GATGTACATATT...TGTAG|GTT | 0 | 1 | 3.953 |
148094963 | GT-AG | 0 | 1.000000099473604e-05 | 478 | rna-XM_009477762.1 26701860 | 2 | 37467 | 37944 | Pelecanus crispus 36300 | CCT|GTGAGTCTGC...TCTGCTTTCTCT/TATCCAGTGATT...TCTAG|AAT | 2 | 1 | 9.527 |
148094964 | GT-AG | 0 | 1.000000099473604e-05 | 4955 | rna-XM_009477762.1 26701860 | 3 | 32418 | 37372 | Pelecanus crispus 36300 | TTG|GTTAACCTGA...TTTTTTTTCATT/TTTTTTTTCATT...CATAG|ATT | 0 | 1 | 15.619 |
148094965 | GT-AG | 0 | 1.000000099473604e-05 | 1899 | rna-XM_009477762.1 26701860 | 4 | 30374 | 32272 | Pelecanus crispus 36300 | GTG|GTCAGCATAG...AATTCATTAATA/AAATAATTCATT...TTCAG|ACA | 1 | 1 | 25.016 |
148094966 | GT-AG | 0 | 0.0016867140739651 | 3390 | rna-XM_009477762.1 26701860 | 5 | 26833 | 30222 | Pelecanus crispus 36300 | AAG|GTACTTTTCT...TTTCTCTTGCTT/TTTAATATGACT...TTCAG|CCA | 2 | 1 | 34.802 |
148094967 | TA-TT | 0 | 1.000000099473604e-05 | 1376 | rna-XM_009477762.1 26701860 | 6 | 25340 | 26715 | Pelecanus crispus 36300 | GAA|TAGGGAAAGT...TTAATTTAAACT/GAATAATTAATT...TATTT|CAG | 2 | 1 | 42.385 |
148094968 | GT-AG | 0 | 1.000000099473604e-05 | 1751 | rna-XM_009477762.1 26701860 | 7 | 23435 | 25185 | Pelecanus crispus 36300 | AAG|GTCAGCTCTC...GTGTCCTAAAAT/GTCTGGTTTACT...TAAAG|GAT | 0 | 1 | 52.366 |
148094969 | GT-AG | 0 | 1.000000099473604e-05 | 390 | rna-XM_009477762.1 26701860 | 8 | 22989 | 23378 | Pelecanus crispus 36300 | AAG|GTAAAGATGA...TTTCTTTTGAAT/TTTCTTTTGAAT...TTTAG|AAA | 2 | 1 | 55.995 |
148094970 | GT-AG | 0 | 0.0001523292571669 | 2066 | rna-XM_009477762.1 26701860 | 9 | 20862 | 22927 | Pelecanus crispus 36300 | CAG|GTAATCTGTA...TGCCCCTTCATT/GTGTATCTTATG...TTCAG|GCA | 0 | 1 | 59.948 |
148094971 | GT-AG | 0 | 1.000000099473604e-05 | 5617 | rna-XM_009477762.1 26701860 | 10 | 15117 | 20733 | Pelecanus crispus 36300 | TGG|GTGAGAAGTA...AGGTTATTGACT/AGGTTATTGACT...ATGAG|AGG | 2 | 1 | 68.244 |
148094972 | GT-AG | 0 | 1.000000099473604e-05 | 1417 | rna-XM_009477762.1 26701860 | 11 | 13592 | 15008 | Pelecanus crispus 36300 | CAT|GTAAGAACTT...GTGTGCTAAATG/TGTGTGCTAAAT...TGCAG|AAC | 2 | 1 | 75.243 |
148094973 | GT-AG | 0 | 1.000000099473604e-05 | 1106 | rna-XM_009477762.1 26701860 | 13 | 12362 | 13467 | Pelecanus crispus 36300 | AAG|GTGAGTATGC...TTTTTTTTAACG/TTTTTTTTAACG...TGCAG|TGA | 2 | 1 | 83.215 |
148094974 | GT-AG | 0 | 1.000000099473604e-05 | 830 | rna-XM_009477762.1 26701860 | 14 | 11405 | 12234 | Pelecanus crispus 36300 | ACG|GTGAGTAGCT...TTATTTTTGTTT/CTTTGTTTCATT...AACAG|GTG | 0 | 1 | 91.445 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);