introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
11 rows where transcript_id = 6831598
This data as json, CSV (advanced)
Suggested facets: score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
35540778 | GT-AG | 0 | 0.0016218654637301 | 1455 | rna-XM_009709734.1 6831598 | 1 | 4595 | 6049 | Cariama cristata 54380 | GGT|GTAAGTTTAC...GCTTTTTTATCC/TCCTTTTTTATC...CCTAG|ATG | 1 | 1 | 11.672 |
35540779 | GT-AG | 0 | 0.0004575885151024 | 115 | rna-XM_009709734.1 6831598 | 2 | 6151 | 6265 | Cariama cristata 54380 | GAG|GTATTTAATA...CGTTTCTTGTTA/TTTCTTGTTACT...TTTAG|TTG | 0 | 1 | 17.34 |
35540780 | GT-AG | 0 | 1.000000099473604e-05 | 3162 | rna-XM_009709734.1 6831598 | 3 | 6350 | 9511 | Cariama cristata 54380 | GTG|GTAAGGTTGT...TCTTTCTTTTTT/ACAATGGTAATA...TCCAG|GAG | 0 | 1 | 22.054 |
35540781 | GT-AG | 0 | 1.000000099473604e-05 | 676 | rna-XM_009709734.1 6831598 | 4 | 9677 | 10352 | Cariama cristata 54380 | GAG|GTGAGTAAAG...TCGATTTTAATT/TCGATTTTAATT...TTAAG|TTT | 0 | 1 | 31.313 |
35540782 | GT-AG | 0 | 1.000000099473604e-05 | 1578 | rna-XM_009709734.1 6831598 | 5 | 10462 | 12039 | Cariama cristata 54380 | TAG|GTAAGTGTTA...TAGTTTTTAATC/TAGTTTTTAATC...TACAG|AAG | 1 | 1 | 37.43 |
35540783 | GT-AG | 0 | 4.253382343568423e-05 | 1090 | rna-XM_009709734.1 6831598 | 6 | 12415 | 13504 | Cariama cristata 54380 | ATG|GTAAAGTTTT...TTTCTTTTGAAC/TTTCTTTTGAAC...TGTAG|ACA | 1 | 1 | 58.474 |
35540784 | GT-AG | 0 | 1.000000099473604e-05 | 2912 | rna-XM_009709734.1 6831598 | 7 | 13635 | 16546 | Cariama cristata 54380 | CAA|GTAGGTTATG...GTTTGTTTACAA/TGTTTGTTTACA...TATAG|GAC | 2 | 1 | 65.769 |
35540785 | GT-AG | 0 | 0.0075733716218085 | 930 | rna-XM_009709734.1 6831598 | 8 | 16705 | 17634 | Cariama cristata 54380 | AAC|GTAATCCTGG...CCACCCTTAAAT/AAAAAATTTATT...ACCAG|CCA | 1 | 1 | 74.635 |
35540786 | GT-AG | 0 | 1.000000099473604e-05 | 2763 | rna-XM_009709734.1 6831598 | 9 | 17754 | 20516 | Cariama cristata 54380 | AAG|GTAATGCCTT...CTTTTTTTGAAG/TGTTCTCTGATA...GATAG|GAT | 0 | 1 | 81.313 |
35540787 | GT-AG | 0 | 1.000000099473604e-05 | 2267 | rna-XM_009709734.1 6831598 | 10 | 20677 | 22943 | Cariama cristata 54380 | CTG|GTAAGACTTC...GTAATGTTAATT/CTTATATTCATG...TCAAG|GTG | 1 | 1 | 90.292 |
35540788 | GT-AG | 0 | 7.633397916384083e-05 | 4969 | rna-XM_009709734.1 6831598 | 11 | 23036 | 28004 | Cariama cristata 54380 | AAG|GTAATCTGCA...TTTCCTCTAATA/TTTCCTCTAATA...TGCAG|TTG | 0 | 1 | 95.455 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);