introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
23 rows where transcript_id = 32672002
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 182503754 | GT-AG | 0 | 1.000000099473604e-05 | 2645 | rna-XM_009086910.3 32672002 | 1 | 128836401 | 128839045 | Serinus canaria 9135 | GAG|GTAAAACGGG...ATTTCCTAGACT/TTGTGTTTTACA...TGTAG|CAA | 0 | 1 | 1.149 |
| 182503755 | GT-AG | 0 | 1.000000099473604e-05 | 244 | rna-XM_009086910.3 32672002 | 2 | 128836041 | 128836284 | Serinus canaria 9135 | TGG|GTAAGTGAAA...TTAATTTTACTT/TTTACTTTCATT...CCTAG|AGA | 2 | 1 | 3.264 |
| 182503756 | GT-AG | 0 | 0.0001849799711864 | 387 | rna-XM_009086910.3 32672002 | 3 | 128835567 | 128835953 | Serinus canaria 9135 | AAG|GTACGTTCCT...TCTGCTTTATAT/GTATCTTTCATT...AACAG|CCT | 2 | 1 | 4.85 |
| 182503757 | GT-AG | 0 | 1.000000099473604e-05 | 479 | rna-XM_009086910.3 32672002 | 4 | 128835039 | 128835517 | Serinus canaria 9135 | AAG|GTAATGCAGA...TTTATTTTAATT/TTTATTTTAATT...AACAG|GTC | 0 | 1 | 5.744 |
| 182503758 | GT-AG | 0 | 1.000000099473604e-05 | 907 | rna-XM_009086910.3 32672002 | 5 | 128833963 | 128834869 | Serinus canaria 9135 | ATG|GTGAGTCTTG...AAGTTTTTATTT/TGTGGTTTCATT...TATAG|TTG | 1 | 1 | 8.826 |
| 182503759 | GT-AG | 0 | 1.000000099473604e-05 | 786 | rna-XM_009086910.3 32672002 | 6 | 128833054 | 128833839 | Serinus canaria 9135 | CAG|GTAGAATGAG...GTGCTCTTATTG/AGTGCTCTTATT...TCTAG|TGG | 1 | 1 | 11.069 |
| 182503760 | GT-AG | 0 | 1.000000099473604e-05 | 669 | rna-XM_009086910.3 32672002 | 7 | 128832082 | 128832750 | Serinus canaria 9135 | ATG|GTAAGTTGTA...TTATTCATAACT/GGTTTATTCATA...TTTAG|GAG | 1 | 1 | 16.594 |
| 182503761 | GT-AG | 0 | 1.000000099473604e-05 | 1036 | rna-XM_009086910.3 32672002 | 8 | 128829893 | 128830928 | Serinus canaria 9135 | TAG|GTACAGCACT...TTTTCCTGATTT/ATTTTCCTGATT...TTCAG|ATA | 2 | 1 | 37.619 |
| 182503762 | GT-AG | 0 | 1.000000099473604e-05 | 746 | rna-XM_009086910.3 32672002 | 9 | 128828655 | 128829400 | Serinus canaria 9135 | AGG|GTAATGTATG...GTTGTGTTACTG/AGTTGTGTTACT...TTCAG|AGA | 2 | 1 | 46.59 |
| 182503763 | GT-AG | 0 | 0.0085519827005634 | 1950 | rna-XM_009086910.3 32672002 | 10 | 128826550 | 128828499 | Serinus canaria 9135 | AAG|GTACACTTTA...TATTTTTTTTTT/CATAAATTCACT...TTCAG|AGC | 1 | 1 | 49.416 |
| 182503764 | GT-AG | 0 | 1.000000099473604e-05 | 442 | rna-XM_009086910.3 32672002 | 11 | 128826025 | 128826466 | Serinus canaria 9135 | CAG|GTCAGTAAAA...TTTTTCTTTTCT/TATAGGCTCATT...AAAAG|AAT | 0 | 1 | 50.93 |
| 182503765 | GT-AG | 0 | 3.872261331109352e-05 | 135 | rna-XM_009086910.3 32672002 | 12 | 128825796 | 128825930 | Serinus canaria 9135 | AAG|GTATGGCTGA...TATTTCTTTTCT/AAAAATGTTATT...TACAG|GTC | 1 | 1 | 52.644 |
| 182503766 | GT-AG | 0 | 2.895887305906092e-05 | 586 | rna-XM_009086910.3 32672002 | 13 | 128824665 | 128825250 | Serinus canaria 9135 | CAG|GTATAACTGA...TTTATCATAAAA/TATTTTATCATA...TGCAG|GTA | 0 | 1 | 62.582 |
| 182503767 | GT-AG | 0 | 0.0463025960274715 | 731 | rna-XM_009086910.3 32672002 | 14 | 128823698 | 128824428 | Serinus canaria 9135 | AAG|GTATTTTTGT...AGTTTCTTAGAT/AGCTTTTTCACA...TTCAG|CAA | 2 | 1 | 66.885 |
| 182503768 | GT-AG | 0 | 0.000165980353596 | 1481 | rna-XM_009086910.3 32672002 | 15 | 128821964 | 128823444 | Serinus canaria 9135 | CAG|GTATTTGCTA...TCTTTTTTACTA/TTCTTTTTTACT...TTTAG|GAT | 0 | 1 | 71.499 |
| 182503769 | GT-AG | 0 | 0.0001072666420768 | 1241 | rna-XM_009086910.3 32672002 | 16 | 128820505 | 128821745 | Serinus canaria 9135 | GAG|GTATGGTATT...CTCTTTTTGTTT/TATGTACTAATA...AACAG|CTC | 2 | 1 | 75.474 |
| 182503770 | GT-AG | 0 | 1.000000099473604e-05 | 555 | rna-XM_009086910.3 32672002 | 17 | 128819817 | 128820371 | Serinus canaria 9135 | CTG|GTAAGCGAAT...GAGTACTTACTC/TGAGTACTTACT...TTTAG|GGG | 0 | 1 | 77.899 |
| 182503771 | GT-AG | 0 | 1.000000099473604e-05 | 1655 | rna-XM_009086910.3 32672002 | 18 | 128818009 | 128819663 | Serinus canaria 9135 | ATG|GTAAAAGAAT...TATTCCTTCTCT/AAAATTCTGACT...CATAG|GAT | 0 | 1 | 80.689 |
| 182503772 | GT-AG | 0 | 1.000000099473604e-05 | 255 | rna-XM_009086910.3 32672002 | 19 | 128817593 | 128817847 | Serinus canaria 9135 | CAG|GTAAAATGGT...TGGTTCATAACC/TTGTGGTTCATA...TGTAG|GAC | 2 | 1 | 83.625 |
| 182503773 | GT-AG | 0 | 1.000000099473604e-05 | 932 | rna-XM_009086910.3 32672002 | 20 | 128816564 | 128817495 | Serinus canaria 9135 | ATG|GTAAAATGAT...ACTGTTTTGATT/ACTGTTTTGATT...TGAAG|GTA | 0 | 1 | 85.394 |
| 182503774 | GT-AG | 0 | 1.000000099473604e-05 | 144 | rna-XM_009086910.3 32672002 | 21 | 128816253 | 128816396 | Serinus canaria 9135 | AAG|GTAAGTTAAC...GTTACATTAGCT/CATTAGCTAACT...GACAG|TGG | 2 | 1 | 88.439 |
| 182503775 | GT-AG | 0 | 0.0017811721290048 | 889 | rna-XM_009086910.3 32672002 | 22 | 128815032 | 128815920 | Serinus canaria 9135 | AAG|GTATGCTGGT...GTTTGCTCAACA/TGTTTGCTCAAC...TCTAG|GAA | 1 | 1 | 94.493 |
| 182503776 | GT-AG | 0 | 1.1967186352936977e-05 | 458 | rna-XM_009086910.3 32672002 | 23 | 128814430 | 128814887 | Serinus canaria 9135 | CAG|GTACATGGTA...CTTTTCTTTGCT/TTTTTTCCAACT...TTAAG|GCT | 1 | 1 | 97.119 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);