introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
23 rows where transcript_id = 23988675
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 130512922 | GT-AG | 0 | 1.000000099473604e-05 | 457 | rna-gnl|WGS:VZSG|NOTPEN_R04374_mrna 23988675 | 1 | 3825627 | 3826083 | Nothoprocta pentlandii 2585814 | AAG|GTGCGTGGCG...AATTCCTAACCT/CTCTGTTTCACA...GACAG|ACA | 0 | 1 | 4.493 |
| 130512923 | GT-AG | 0 | 0.000933432277867 | 18278 | rna-gnl|WGS:VZSG|NOTPEN_R04374_mrna 23988675 | 2 | 3826158 | 3844435 | Nothoprocta pentlandii 2585814 | CAG|GTATTTCCTT...GCCATCTTAAGA/CAGTGGTTAAAA...GTAAG|CTC | 2 | 1 | 7.335 |
| 130512924 | GT-AG | 0 | 0.0005943233704085 | 3451 | rna-gnl|WGS:VZSG|NOTPEN_R04374_mrna 23988675 | 3 | 3844539 | 3847989 | Nothoprocta pentlandii 2585814 | AGA|GTAAGTTTTT...ATCCTTTTAAAC/TTCCTGCTCATC...TACAG|GAA | 0 | 1 | 11.29 |
| 130512925 | GT-AG | 0 | 1.000000099473604e-05 | 683 | rna-gnl|WGS:VZSG|NOTPEN_R04374_mrna 23988675 | 4 | 3848138 | 3848820 | Nothoprocta pentlandii 2585814 | CAG|GTCAGGGCTT...ATATACTTGTTT/TTGTTTCTTACT...GGCAG|GCG | 1 | 1 | 16.974 |
| 130512926 | GT-AG | 0 | 1.000000099473604e-05 | 1557 | rna-gnl|WGS:VZSG|NOTPEN_R04374_mrna 23988675 | 5 | 3848895 | 3850451 | Nothoprocta pentlandii 2585814 | GAG|GTAATAATAT...TTTTTCTTCTTT/ATACTATTTACT...TATAG|ATC | 0 | 1 | 19.816 |
| 130512927 | GT-AG | 0 | 1.000000099473604e-05 | 2662 | rna-gnl|WGS:VZSG|NOTPEN_R04374_mrna 23988675 | 6 | 3850559 | 3853220 | Nothoprocta pentlandii 2585814 | TAT|GTGAGTACAA...TGGATTTTATAT/ATAATGTTCACA...TTTAG|TAT | 2 | 1 | 23.925 |
| 130512928 | GT-AG | 0 | 1.000000099473604e-05 | 2869 | rna-gnl|WGS:VZSG|NOTPEN_R04374_mrna 23988675 | 7 | 3853329 | 3856197 | Nothoprocta pentlandii 2585814 | AGG|GTGAGTAACG...TTTCTCTTGACT/TGTTGATTCATT...TGCAG|GAC | 2 | 1 | 28.072 |
| 130512929 | GT-AG | 0 | 1.000000099473604e-05 | 2597 | rna-gnl|WGS:VZSG|NOTPEN_R04374_mrna 23988675 | 8 | 3856320 | 3858916 | Nothoprocta pentlandii 2585814 | CAG|GTAAGAGCTT...AAATGCTTGATA/CGGGAACTGACT...TTCAG|ATG | 1 | 1 | 32.757 |
| 130512930 | GT-AG | 0 | 0.0662381906682207 | 1303 | rna-gnl|WGS:VZSG|NOTPEN_R04374_mrna 23988675 | 9 | 3859040 | 3860342 | Nothoprocta pentlandii 2585814 | ATG|GTATGTTTTA...TTTTCCTTTTTT/GCGACTATCATT...TTGAG|GCA | 1 | 1 | 37.481 |
| 130512931 | GT-AG | 0 | 2.490734651621317e-05 | 913 | rna-gnl|WGS:VZSG|NOTPEN_R04374_mrna 23988675 | 10 | 3860513 | 3861425 | Nothoprocta pentlandii 2585814 | CCA|GTAAGTAAGC...AAAGCCTTATTT/GCCTTATTTATC...CACAG|GCC | 0 | 1 | 44.009 |
| 130512932 | GT-AG | 0 | 9.354392977640253e-05 | 1940 | rna-gnl|WGS:VZSG|NOTPEN_R04374_mrna 23988675 | 11 | 3861546 | 3863485 | Nothoprocta pentlandii 2585814 | GAA|GTAGGTCTAC...TTCACCTTATTA/GAGTGTTTCACC...TTTAG|GTT | 0 | 1 | 48.618 |
| 130512933 | GT-AG | 0 | 1.000000099473604e-05 | 794 | rna-gnl|WGS:VZSG|NOTPEN_R04374_mrna 23988675 | 12 | 3863564 | 3864357 | Nothoprocta pentlandii 2585814 | GAG|GTAATGTACT...CAAACCTGATTC/AGTTAATTTACA...AATAG|GTG | 0 | 1 | 51.613 |
| 130512934 | GT-AG | 0 | 1.000000099473604e-05 | 1116 | rna-gnl|WGS:VZSG|NOTPEN_R04374_mrna 23988675 | 13 | 3864442 | 3865557 | Nothoprocta pentlandii 2585814 | AAG|GTCAGTATTC...ATTGTTTTAATG/ATTGTTTTAATG...TTCAG|TTA | 0 | 1 | 54.839 |
| 130512935 | GT-AG | 0 | 2.313502696184223e-05 | 1770 | rna-gnl|WGS:VZSG|NOTPEN_R04374_mrna 23988675 | 14 | 3865741 | 3867510 | Nothoprocta pentlandii 2585814 | AAC|GTGAGCCTTC...TTTCCTTTATTT/TTATGTTTAATT...TCAAG|AAT | 0 | 1 | 61.866 |
| 130512936 | GT-AG | 0 | 1.000000099473604e-05 | 582 | rna-gnl|WGS:VZSG|NOTPEN_R04374_mrna 23988675 | 15 | 3867617 | 3868198 | Nothoprocta pentlandii 2585814 | AAC|GTAAGATATT...GTATCGTTATTT/AAAACTTTTATA...TTCAG|AAT | 1 | 1 | 65.937 |
| 130512937 | GT-AG | 0 | 3.725625312947e-05 | 3155 | rna-gnl|WGS:VZSG|NOTPEN_R04374_mrna 23988675 | 16 | 3868347 | 3871501 | Nothoprocta pentlandii 2585814 | GAG|GTACTGTACC...GAATTTTTACTT/TGAATTTTTACT...GATAG|AAT | 2 | 1 | 71.621 |
| 130512938 | GT-AG | 0 | 0.0013762891230934 | 618 | rna-gnl|WGS:VZSG|NOTPEN_R04374_mrna 23988675 | 17 | 3871647 | 3872264 | Nothoprocta pentlandii 2585814 | AGA|GTAAGCATTT...ACTATTTTAATT/ACTATTTTAATT...TATAG|ACA | 0 | 1 | 77.189 |
| 130512939 | GT-AG | 0 | 0.0003836218267609 | 90 | rna-gnl|WGS:VZSG|NOTPEN_R04374_mrna 23988675 | 18 | 3872307 | 3872396 | Nothoprocta pentlandii 2585814 | GGT|GTAAGTATTT...TTGTTTTTATTA/TTTTTATTAATT...TGAAG|TTT | 0 | 1 | 78.802 |
| 130512940 | GT-AG | 0 | 0.0252902373967383 | 2991 | rna-gnl|WGS:VZSG|NOTPEN_R04374_mrna 23988675 | 19 | 3872472 | 3875462 | Nothoprocta pentlandii 2585814 | ATG|GTATATTTTA...AATTTCTGATCT/AAATTTCTGATC...TATAG|GAA | 0 | 1 | 81.682 |
| 130512941 | GT-AG | 0 | 0.000140293191486 | 422 | rna-gnl|WGS:VZSG|NOTPEN_R04374_mrna 23988675 | 20 | 3875539 | 3875960 | Nothoprocta pentlandii 2585814 | AGC|GTAAGTTTAC...AGGTATTTAACA/GCATTATTTATA...TTTAG|GTA | 1 | 1 | 84.601 |
| 130512942 | GT-AG | 0 | 1.000000099473604e-05 | 329 | rna-gnl|WGS:VZSG|NOTPEN_R04374_mrna 23988675 | 21 | 3876045 | 3876373 | Nothoprocta pentlandii 2585814 | CAG|GTAAGAATTT...ATTTCCATATTA/ATGCTACTAATA...TTTAG|GCA | 1 | 1 | 87.826 |
| 130512943 | GT-AG | 0 | 11.107578946359409 | 389 | rna-gnl|WGS:VZSG|NOTPEN_R04374_mrna 23988675 | 22 | 3876468 | 3876856 | Nothoprocta pentlandii 2585814 | TAA|GTATCTTTAT...ACATTTTTATAA/AGTTTATTTATG...TAAAG|TAG | 2 | 1 | 91.436 |
| 130512944 | GT-AG | 0 | 1.000000099473604e-05 | 797 | rna-gnl|WGS:VZSG|NOTPEN_R04374_mrna 23988675 | 23 | 3876928 | 3877724 | Nothoprocta pentlandii 2585814 | GTG|GTAAGTAGGC...TTTGTCTAAGCT/TTTTTTTTCAGA...TTCAG|ATG | 1 | 1 | 94.163 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);