introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
24 rows where transcript_id = 2014040
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 10831978 | GT-AG | 0 | 0.0001835542604119 | 1400 | rna-XM_013187228.1 2014040 | 2 | 2683698 | 2685097 | Anser cygnoides 8845 | TAG|GTATTTAAAG...AAACTTTTATTA/TTACTTTTCATG...TGCAG|GGT | 1 | 1 | 6.317 |
| 10831979 | GT-AG | 0 | 0.0001009040710735 | 920 | rna-XM_013187228.1 2014040 | 3 | 2682570 | 2683489 | Anser cygnoides 8845 | CAG|GTAAACACAC...GTACTCTTGACT/GTACTCTTGACT...TTTAG|ATT | 2 | 1 | 13.457 |
| 10831980 | GT-AG | 0 | 1.000000099473604e-05 | 3869 | rna-XM_013187228.1 2014040 | 4 | 2678531 | 2682399 | Anser cygnoides 8845 | CAG|GTTTGTCAGA...TAACTTTTATTT/ATATTGTTAACT...TACAG|GGA | 1 | 1 | 19.293 |
| 10831981 | GT-AG | 0 | 1.000000099473604e-05 | 4465 | rna-XM_013187228.1 2014040 | 5 | 2673943 | 2678407 | Anser cygnoides 8845 | GAG|GTGAGAGCTC...TATTCTTTTTCT/CTTTTTCTAATG...TGTAG|AAT | 1 | 1 | 23.515 |
| 10831982 | GT-AG | 0 | 1.000000099473604e-05 | 387 | rna-XM_013187228.1 2014040 | 6 | 2673443 | 2673829 | Anser cygnoides 8845 | CGG|GTAGGTATAG...ATCTCCTTTTTT/TTTGTTTTCATC...TTCAG|CAA | 0 | 1 | 27.394 |
| 10831983 | GT-AG | 0 | 1.000000099473604e-05 | 635 | rna-XM_013187228.1 2014040 | 7 | 2672645 | 2673279 | Anser cygnoides 8845 | AAG|GTAAGTCCTT...TTCTCTTTGATT/TTCTCTTTGATT...TATAG|GAT | 1 | 1 | 32.99 |
| 10831984 | GT-AG | 0 | 1.000000099473604e-05 | 542 | rna-XM_013187228.1 2014040 | 8 | 2672010 | 2672551 | Anser cygnoides 8845 | TTT|GTAAGTGAAG...TATTTCTTTCTG/GAGTTATTAAGC...TCCAG|TGC | 1 | 1 | 36.183 |
| 10831985 | GT-AG | 0 | 1.000000099473604e-05 | 786 | rna-XM_013187228.1 2014040 | 9 | 2671132 | 2671917 | Anser cygnoides 8845 | AAG|GTACAGTGAT...CCCTTCTTGAAT/CAGTTTTTGATA...CCTAG|GAT | 0 | 1 | 39.341 |
| 10831986 | GT-AG | 0 | 1.000000099473604e-05 | 3424 | rna-XM_013187228.1 2014040 | 10 | 2667627 | 2671050 | Anser cygnoides 8845 | CAT|GTAAGTGATA...TTTTGTTTAATT/TTTTGTTTAATT...TTTAG|TAT | 0 | 1 | 42.122 |
| 10831987 | GT-AG | 0 | 1.000000099473604e-05 | 6025 | rna-XM_013187228.1 2014040 | 11 | 2661498 | 2667522 | Anser cygnoides 8845 | TCG|GTAAGTTGAA...TGTCTCTTTGTT/TGTTTTCTCAGA...TTCAG|AGT | 2 | 1 | 45.692 |
| 10831988 | GC-AG | 0 | 1.000000099473604e-05 | 971 | rna-XM_013187228.1 2014040 | 12 | 2660424 | 2661394 | Anser cygnoides 8845 | AAG|GCATGTTAAA...TTTTTCCTATTT/AGAAATTTTACT...TCTAG|AAA | 0 | 1 | 49.228 |
| 10831989 | GT-AG | 0 | 1.000000099473604e-05 | 2157 | rna-XM_013187228.1 2014040 | 13 | 2658144 | 2660300 | Anser cygnoides 8845 | AAG|GTAAGGTATC...GATGTCTTAACC/AATAGTTTAATA...CACAG|GAC | 0 | 1 | 53.45 |
| 10831990 | GT-AG | 0 | 9.566511356611091e-05 | 3998 | rna-XM_013187228.1 2014040 | 14 | 2654063 | 2658060 | Anser cygnoides 8845 | CAG|GTATGGTATG...ATAATTTTAAAC/GCATATTTTACT...TCCAG|CCC | 2 | 1 | 56.299 |
| 10831991 | GT-AG | 0 | 6.312043607376287e-05 | 2782 | rna-XM_013187228.1 2014040 | 15 | 2651136 | 2653917 | Anser cygnoides 8845 | TAT|GTAAGTACCC...TTATTCTTACTT/TTTATTCTTACT...CATAG|CTC | 0 | 1 | 61.277 |
| 10831992 | GT-AG | 0 | 9.668750800338064e-05 | 1593 | rna-XM_013187228.1 2014040 | 16 | 2649432 | 2651024 | Anser cygnoides 8845 | GCG|GTAAGCTACG...AGTTCTATAATT/TTTAAACTCAAT...TTCAG|TAT | 0 | 1 | 65.088 |
| 10831993 | GT-AG | 0 | 1.2349950777355656e-05 | 1991 | rna-XM_013187228.1 2014040 | 17 | 2647320 | 2649310 | Anser cygnoides 8845 | ATG|GTAATGTTCT...ATATCATTGATT/ATTGATTTAACT...TCTAG|ATG | 1 | 1 | 69.241 |
| 10831994 | GT-AG | 0 | 1.000000099473604e-05 | 825 | rna-XM_013187228.1 2014040 | 18 | 2646403 | 2647227 | Anser cygnoides 8845 | CAG|GTTGGTGTGA...AATTTCTTACAT/TAATTTCTTACA...TCCAG|GCT | 0 | 1 | 72.4 |
| 10831995 | GT-AG | 0 | 8.365894432698146e-05 | 1017 | rna-XM_013187228.1 2014040 | 19 | 2645274 | 2646290 | Anser cygnoides 8845 | ACA|GTAAGTTAAA...ACTGTTTTAACT/ACTGTTTTAACT...TTTAG|GAG | 1 | 1 | 76.244 |
| 10831996 | GT-AG | 0 | 9.751665662021686e-05 | 3440 | rna-XM_013187228.1 2014040 | 20 | 2641666 | 2645105 | Anser cygnoides 8845 | TAG|GTAAGTTTTC...TTTCTTTTAAAT/AAATTTGTCACT...CCAAG|GTT | 1 | 1 | 82.012 |
| 10831997 | GT-AG | 0 | 1.000000099473604e-05 | 2399 | rna-XM_013187228.1 2014040 | 21 | 2638994 | 2641392 | Anser cygnoides 8845 | GAG|GTAAGAAAAA...TAATTTTTACTA/CTTATTCTAATT...TGCAG|ATA | 1 | 1 | 91.383 |
| 10831998 | GT-AG | 0 | 1.000000099473604e-05 | 769 | rna-XM_013187228.1 2014040 | 22 | 2638163 | 2638931 | Anser cygnoides 8845 | AAG|GTAAGTTAGT...AATCTCTGAGTG/CAATCTCTGAGT...TGTAG|GTG | 0 | 1 | 93.512 |
| 10831999 | GT-AG | 0 | 5.27200950724075e-05 | 2024 | rna-XM_013187228.1 2014040 | 23 | 2636066 | 2638089 | Anser cygnoides 8845 | CAT|GTAAGTATTG...ATAATTTTGATG/ATAATTTTGATG...AATAG|GCC | 1 | 1 | 96.018 |
| 10832000 | GT-AG | 0 | 1.000000099473604e-05 | 1121 | rna-XM_013187228.1 2014040 | 24 | 2634877 | 2635997 | Anser cygnoides 8845 | CAG|GTAAAAAGAA...GTCTTCTGATTA/AGTCTTCTGATT...TGCAG|CCA | 0 | 1 | 98.352 |
| 10834301 | GT-AG | 0 | 0.0001022971136655 | 4968 | rna-XM_013187228.1 2014040 | 1 | 2685283 | 2690250 | Anser cygnoides 8845 | TGT|GTAAGTTACT...TTTTTTTTTTTT/CAGGGCTTAAAT...TTAAG|GCT | 0 | 0.824 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);