introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
22 rows where transcript_id = 3380103
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 16590430 | GT-AG | 0 | 1.000000099473604e-05 | 34253 | rna-gnl|WGS:VWPR|ARDKOR_R10111_mrna 3380103 | 1 | 5045375 | 5079627 | Ardeotis kori 89386 | CAG|GTGGGCGGCG...GTTCTTTTATTC/CTTTTATTCATC...TTCAG|GTA | 2 | 1 | 3.551 |
| 16590431 | GT-AG | 0 | 1.000000099473604e-05 | 5788 | rna-gnl|WGS:VWPR|ARDKOR_R10111_mrna 3380103 | 2 | 5079738 | 5085525 | Ardeotis kori 89386 | CCA|GTAAGTGACA...TTCCCATTGATC/TTCCCATTGATC...CTAAG|AAA | 1 | 1 | 6.834 |
| 16590432 | GT-AG | 0 | 1.000000099473604e-05 | 1692 | rna-gnl|WGS:VWPR|ARDKOR_R10111_mrna 3380103 | 3 | 5085730 | 5087421 | Ardeotis kori 89386 | GCT|GTGAGTAGTT...AGTGTTGTATCT/GCTAAACTGAGT...TCTAG|TGC | 1 | 1 | 12.922 |
| 16590433 | GT-AG | 0 | 1.000000099473604e-05 | 936 | rna-gnl|WGS:VWPR|ARDKOR_R10111_mrna 3380103 | 4 | 5087499 | 5088434 | Ardeotis kori 89386 | AGG|GTAAGCAGTA...TGTACCTAAATA/TAAATATTCATA...TACAG|GTT | 0 | 1 | 15.219 |
| 16590434 | GT-AG | 0 | 0.0004293059348307 | 1170 | rna-gnl|WGS:VWPR|ARDKOR_R10111_mrna 3380103 | 5 | 5088515 | 5089684 | Ardeotis kori 89386 | GAA|GTAAGTTTGT...ACTTTTTTAAAA/TGTTAGCTTACT...TGCAG|AAT | 2 | 1 | 17.607 |
| 16590435 | GT-AG | 0 | 1.000000099473604e-05 | 761 | rna-gnl|WGS:VWPR|ARDKOR_R10111_mrna 3380103 | 6 | 5089815 | 5090575 | Ardeotis kori 89386 | CAG|GTATTGAAAC...CACCTCATGACT/AGTGAGCTCACC...TTTAG|AGC | 0 | 1 | 21.486 |
| 16590436 | GT-AG | 0 | 1.000000099473604e-05 | 1048 | rna-gnl|WGS:VWPR|ARDKOR_R10111_mrna 3380103 | 7 | 5090723 | 5091770 | Ardeotis kori 89386 | AAC|GTGAGTAATG...GTTTTTCTATTC/TGATGACTAAGT...CTCAG|TTT | 0 | 1 | 25.873 |
| 16590437 | GT-AG | 0 | 1.000000099473604e-05 | 4118 | rna-gnl|WGS:VWPR|ARDKOR_R10111_mrna 3380103 | 8 | 5091845 | 5095962 | Ardeotis kori 89386 | AAG|GTAATTCTTC...GCTTTCTTTTTG/CGTGGAATAACT...TGCAG|AAT | 2 | 1 | 28.081 |
| 16590438 | GT-AG | 0 | 6.384694694838747e-05 | 1480 | rna-gnl|WGS:VWPR|ARDKOR_R10111_mrna 3380103 | 9 | 5096206 | 5097685 | Ardeotis kori 89386 | GAG|GTAGGTTCCT...GCTGCTTTAGCT/CTTTAGCTCATC...CCTAG|ATT | 2 | 1 | 35.333 |
| 16590439 | GT-AG | 0 | 9.593685383968787e-05 | 2385 | rna-gnl|WGS:VWPR|ARDKOR_R10111_mrna 3380103 | 10 | 5097776 | 5100160 | Ardeotis kori 89386 | GAG|GTATGTAACC...GATTTTCTGACT/GATTTTCTGACT...TTCAG|TCT | 2 | 1 | 38.019 |
| 16590440 | GT-AG | 0 | 0.0002758116329269 | 4156 | rna-gnl|WGS:VWPR|ARDKOR_R10111_mrna 3380103 | 11 | 5100691 | 5104846 | Ardeotis kori 89386 | AAG|GTATGTAACA...TTTTCATTAACA/CTTATTTTCATT...GATAG|GTA | 1 | 1 | 53.835 |
| 16590441 | GT-AG | 0 | 0.0011401962109524 | 1788 | rna-gnl|WGS:VWPR|ARDKOR_R10111_mrna 3380103 | 12 | 5104956 | 5106743 | Ardeotis kori 89386 | AAG|GTAGACATCA...AAGTCCTTAATT/AATTGTCTGATT...TTCAG|GCC | 2 | 1 | 57.087 |
| 16590442 | GT-AG | 0 | 1.000000099473604e-05 | 1070 | rna-gnl|WGS:VWPR|ARDKOR_R10111_mrna 3380103 | 13 | 5106825 | 5107894 | Ardeotis kori 89386 | GAG|GTAAGCGCTA...ACTTCTCTAATA/GCTTTTTTTATG...TTCAG|ATA | 2 | 1 | 59.505 |
| 16590443 | GT-AG | 0 | 1.000000099473604e-05 | 734 | rna-gnl|WGS:VWPR|ARDKOR_R10111_mrna 3380103 | 14 | 5107982 | 5108715 | Ardeotis kori 89386 | AGT|GTAAGTGGGG...AATGCTTTTTTC/GTAGTACTGACT...AATAG|GAA | 2 | 1 | 62.101 |
| 16590444 | GT-AG | 0 | 1.000000099473604e-05 | 787 | rna-gnl|WGS:VWPR|ARDKOR_R10111_mrna 3380103 | 15 | 5108909 | 5109695 | Ardeotis kori 89386 | TTG|GTTAGTATTA...TTCATTTTAGTA/TAGTGGTTCATT...AATAG|GGT | 0 | 1 | 67.86 |
| 16590445 | GT-AG | 0 | 1.000000099473604e-05 | 603 | rna-gnl|WGS:VWPR|ARDKOR_R10111_mrna 3380103 | 16 | 5109849 | 5110451 | Ardeotis kori 89386 | ACA|GTAAGAAAAA...AGCCTCTTGATC/TGTATCTTCACA...CTCAG|GAC | 0 | 1 | 72.426 |
| 16590446 | GT-AG | 0 | 1.000000099473604e-05 | 793 | rna-gnl|WGS:VWPR|ARDKOR_R10111_mrna 3380103 | 17 | 5110553 | 5111345 | Ardeotis kori 89386 | CTG|GTAGGTAACA...GGTTCAATAACA/GGTTCAATAACA...AACAG|GCA | 2 | 1 | 75.44 |
| 16590447 | GT-AG | 0 | 1.000000099473604e-05 | 671 | rna-gnl|WGS:VWPR|ARDKOR_R10111_mrna 3380103 | 18 | 5111494 | 5112164 | Ardeotis kori 89386 | AAG|GTAAAGTCAA...CGGGCATTTACA/CGGGCATTTACA...TTCAG|GTG | 0 | 1 | 79.857 |
| 16590448 | GT-AG | 0 | 1.000000099473604e-05 | 1452 | rna-gnl|WGS:VWPR|ARDKOR_R10111_mrna 3380103 | 19 | 5112330 | 5113781 | Ardeotis kori 89386 | CGG|GTAAGGAAAA...TTTGTTTTATGA/GTGGGTTTCACT...TTCAG|CTG | 0 | 1 | 84.781 |
| 16590449 | GT-AG | 0 | 1.000000099473604e-05 | 1432 | rna-gnl|WGS:VWPR|ARDKOR_R10111_mrna 3380103 | 20 | 5113875 | 5115306 | Ardeotis kori 89386 | CGG|GTAAAAACAT...TAGTTCTCATCA/ATAGTTCTCATC...TCTAG|CCT | 0 | 1 | 87.556 |
| 16590450 | GT-AG | 0 | 1.000000099473604e-05 | 623 | rna-gnl|WGS:VWPR|ARDKOR_R10111_mrna 3380103 | 21 | 5115538 | 5116160 | Ardeotis kori 89386 | CGA|GTGAGTAACA...TTTCCATTATTT/TTTTCCATTATT...TTTAG|GTG | 0 | 1 | 94.449 |
| 16590451 | GT-AG | 0 | 1.000000099473604e-05 | 532 | rna-gnl|WGS:VWPR|ARDKOR_R10111_mrna 3380103 | 22 | 5116283 | 5116814 | Ardeotis kori 89386 | CAG|GTAATGACAG...CGTTTCTGGAAA/TAACAACTGATC...TTTAG|AGA | 2 | 1 | 98.09 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);