introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
29 rows where transcript_id = 19079848
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position | 
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 101754730 | GT-AG | 0 | 1.000000099473604e-05 | 1149 | rna-XM_042865969.1 19079848 | 2 | 1457587 | 1458735 | Lagopus leucura 30410 | GTA|GTAAGTAATG...ACCATTTTGTTT/TGTTTGTTCAAA...TGCAG|GCT | 0 | 1 | 0.827 | 
| 101754731 | GT-AG | 0 | 1.000000099473604e-05 | 897 | rna-XM_042865969.1 19079848 | 3 | 1456546 | 1457442 | Lagopus leucura 30410 | AAG|GTACTGTGTA...TTCTTTGTGACT/CTGGAGCTGATT...TGTAG|GAG | 0 | 1 | 1.82 | 
| 101754732 | GT-AG | 0 | 1.000000099473604e-05 | 848 | rna-XM_042865969.1 19079848 | 4 | 1455551 | 1456398 | Lagopus leucura 30410 | AAG|GTAAGTGTCT...ATGGTATTAAAG/TACCTATTTATG...TGCAG|GCA | 0 | 1 | 2.833 | 
| 101754733 | GT-AG | 0 | 1.000000099473604e-05 | 717 | rna-XM_042865969.1 19079848 | 5 | 1454711 | 1455427 | Lagopus leucura 30410 | AAG|GTGGTATTTT...GTTGTCTTTTCA/TGTCTTTTCATT...TTCAG|GAT | 0 | 1 | 3.681 | 
| 101754734 | GT-AG | 0 | 0.0446487860131852 | 339 | rna-XM_042865969.1 19079848 | 6 | 1454204 | 1454542 | Lagopus leucura 30410 | AAG|GTATGTTTCA...TTCTTCTTATCA/TTTTTCTTTATT...TAAAG|AAC | 0 | 1 | 4.839 | 
| 101754735 | GT-AG | 0 | 1.000000099473604e-05 | 733 | rna-XM_042865969.1 19079848 | 7 | 1453285 | 1454017 | Lagopus leucura 30410 | AAT|GTAAGTAAGC...CTTTTCTGAACA/ACTTTTCTGAAC...TTTAG|GTA | 0 | 1 | 6.121 | 
| 101754736 | GT-AG | 0 | 1.4644375447472136e-05 | 202 | rna-XM_042865969.1 19079848 | 8 | 1452960 | 1453161 | Lagopus leucura 30410 | CAG|GTGTGTTGCT...GATAGCTTGATT/TCTATATTGATA...ACTAG|CTA | 0 | 1 | 6.969 | 
| 101754737 | GT-AG | 0 | 1.000000099473604e-05 | 1674 | rna-XM_042865969.1 19079848 | 9 | 1451172 | 1452845 | Lagopus leucura 30410 | AAA|GTAAGAAATG...TTTTTTTTTTTT/AAATACTTCATA...AACAG|GAG | 0 | 1 | 7.754 | 
| 101754738 | GT-AG | 0 | 7.449084666470153 | 337 | rna-XM_042865969.1 19079848 | 10 | 1450513 | 1450849 | Lagopus leucura 30410 | AAG|GTATCCTATT...TATGCCTTGCTT/CCCTTACTGATA...TTTAG|TAC | 1 | 1 | 9.974 | 
| 101754739 | GT-AG | 0 | 0.0007732983639079 | 240 | rna-XM_042865969.1 19079848 | 11 | 1450163 | 1450402 | Lagopus leucura 30410 | AAT|GTATGTGCAG...CAAATTTTGATG/CAAATTTTGATG...TTTAG|TTT | 0 | 1 | 10.732 | 
| 101754740 | GT-AG | 0 | 1.000000099473604e-05 | 795 | rna-XM_042865969.1 19079848 | 12 | 1449250 | 1450044 | Lagopus leucura 30410 | ACG|GTAAAACAAG...TAATTCATGATT/TCAAACCTAATT...TGTAG|AAG | 1 | 1 | 11.545 | 
| 101754741 | GT-AG | 0 | 1.000000099473604e-05 | 931 | rna-XM_042865969.1 19079848 | 13 | 1448227 | 1449157 | Lagopus leucura 30410 | CAG|GTTAACTTTA...ATGTTCATAGTG/TATATTTTCATG...GTTAG|GCT | 0 | 1 | 12.179 | 
| 101754742 | TG-AA | 0 | 1.000000099473604e-05 | 3670 | rna-XM_042865969.1 19079848 | 14 | 1444405 | 1448074 | Lagopus leucura 30410 | AGG|TGGGTTGTTT...ATAATTGTAATT/TAGTGTCTGATA...ACAAA|ACT | 2 | 1 | 13.227 | 
| 101754743 | GT-AG | 0 | 1.000000099473604e-05 | 861 | rna-XM_042865969.1 19079848 | 15 | 1443390 | 1444250 | Lagopus leucura 30410 | CAG|GTGGGTTAAA...CTTCTTTTAATT/CTTCTTTTAATT...CGTAG|GAA | 0 | 1 | 14.289 | 
| 101754744 | GT-AG | 0 | 1.000000099473604e-05 | 844 | rna-XM_042865969.1 19079848 | 16 | 1442419 | 1443262 | Lagopus leucura 30410 | CAG|GTGTGATAGA...GATTTTTTAATA/CTTGAACTAATT...AATAG|GCT | 1 | 1 | 15.164 | 
| 101754745 | GT-AG | 0 | 1.000000099473604e-05 | 1470 | rna-XM_042865969.1 19079848 | 17 | 1440818 | 1442287 | Lagopus leucura 30410 | CAG|GTGAGCTTTT...ATCCTCTTGATT/TCTTGATTAACT...TGCAG|AAT | 0 | 1 | 16.067 | 
| 101754746 | GT-AG | 0 | 1.000000099473604e-05 | 286 | rna-XM_042865969.1 19079848 | 18 | 1440422 | 1440707 | Lagopus leucura 30410 | AGG|GTAAAATAAA...TGGAATTTAGTT/AATTTAGTTATC...CTCAG|GTT | 2 | 1 | 16.825 | 
| 101754747 | GT-AG | 0 | 0.0004061996230318 | 346 | rna-XM_042865969.1 19079848 | 19 | 1439907 | 1440252 | Lagopus leucura 30410 | AAC|GTAAGTTACT...TTATTTTTAATA/TTATTTTTAATA...CTAAG|CTG | 0 | 1 | 17.99 | 
| 101754748 | GT-AG | 0 | 0.0057278427814712 | 612 | rna-XM_042865969.1 19079848 | 20 | 1439190 | 1439801 | Lagopus leucura 30410 | AAG|GTAACTCATA...GTTTTCTTGATG/CTTTTAATTATT...TTTAG|ACT | 0 | 1 | 18.714 | 
| 101754749 | AT-AC | 0 | 0.2892917623907072 | 48 | rna-XM_042865969.1 19079848 | 21 | 1433480 | 1433527 | Lagopus leucura 30410 | AAG|ATCTTCAAAG...TTGGCTTGAATC/TTTGGCTTGAAT...AAAAC|ATC | 1 | 1 | 57.741 | 
| 101754750 | GT-AG | 0 | 0.0019407406315379 | 402 | rna-XM_042865969.1 19079848 | 22 | 1428087 | 1428488 | Lagopus leucura 30410 | AAG|GTAGCTTCTG...TATACTTTTCCT/GGTTTACTAATC...TGTAG|GCT | 0 | 1 | 92.142 | 
| 101754751 | GT-AG | 0 | 0.0001952480066252 | 738 | rna-XM_042865969.1 19079848 | 23 | 1427197 | 1427934 | Lagopus leucura 30410 | GAG|GTATAAATGT...AGTTTTTTATTT/TGTATTTTCATT...TTCAG|GAT | 2 | 1 | 93.19 | 
| 101754752 | GT-AG | 0 | 1.479694143005062e-05 | 2640 | rna-XM_042865969.1 19079848 | 24 | 1424397 | 1427036 | Lagopus leucura 30410 | AAG|GTAAACAGAA...AATTCCTTTACT/AATATACTGATA...TTAAG|GCT | 0 | 1 | 94.293 | 
| 101754753 | GT-AG | 0 | 0.002938369505842 | 1026 | rna-XM_042865969.1 19079848 | 25 | 1423218 | 1424243 | Lagopus leucura 30410 | AAG|GTAACTTAAA...TTTGCTTTATTG/TGTTTTTTGAAT...TTCAG|AAT | 0 | 1 | 95.347 | 
| 101754754 | GT-AG | 0 | 1.000000099473604e-05 | 1460 | rna-XM_042865969.1 19079848 | 26 | 1421590 | 1423049 | Lagopus leucura 30410 | CAG|GTAAATGAAC...TTTTTTATGACT/TTTTTTATGACT...AATAG|GAT | 0 | 1 | 96.505 | 
| 101754755 | GT-AG | 0 | 1.000000099473604e-05 | 307 | rna-XM_042865969.1 19079848 | 27 | 1421185 | 1421491 | Lagopus leucura 30410 | AAG|GTCAGAATTG...TCATTTTTCATT/TCATTTTTCATT...TATAG|GCT | 2 | 1 | 97.181 | 
| 101754756 | GT-AG | 0 | 5.714834294324658e-05 | 840 | rna-XM_042865969.1 19079848 | 28 | 1420210 | 1421049 | Lagopus leucura 30410 | CAG|GTAAACTGTA...ATTGTCTAACTT/CATTGTCTAACT...TCTAG|ATT | 2 | 1 | 98.111 | 
| 101754757 | GT-AG | 0 | 1.000000099473604e-05 | 770 | rna-XM_042865969.1 19079848 | 29 | 1419325 | 1420094 | Lagopus leucura 30410 | AAG|GTAAGTAAAT...TGCTTCTTAGGT/TAGGTATTTATG...ACCAG|GTG | 0 | 1 | 98.904 | 
| 101760708 | GT-AG | 0 | 1.000000099473604e-05 | 2985 | rna-XM_042865969.1 19079848 | 1 | 1458794 | 1461778 | Lagopus leucura 30410 | CAG|GTGAGTGGGG...GAGCCATTGATG/TGTTATTTCATT...CATAG|CCA | 0 | 0.517 | 
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
  "dinucleotide_pair" TEXT,
  "is_minor" INTEGER,
  "score" REAL,
  "length" INTEGER,
  "transcript_id" INTEGER,
  "ordinal_index" INTEGER,
  "start" INTEGER,
  "end" INTEGER,
  "taxonomy_id" INTEGER,
  "scored_motifs" TEXT,
  "phase" INTEGER,
  "in_cds" INTEGER,
  "relative_position" REAL
  ,PRIMARY KEY ([id]),
   FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
   FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
    ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
    ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
    ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
    ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
    ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
    ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
    ON [introns] ([in_cds]);