home / WtMTA

introns

Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)

id
INTEGER (primary key), globally unique identifier for each intron
dinucleotide_pair
TEXT, terminal dinucleotide sequences of the intron
is_minor
INTEGER, indicates if the intron is a minor intron (1) or not (0)
score
REAL, score representing the probability (0-100%) of the intron being minor
length
INTEGER, length of the intron in base pairs
transcript_id
INTEGER (foreign key referencing transcripts(id)), parent transcript
ordinal_index
INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
start
INTEGER, start position of the intron in the genome
end
INTEGER, end position of the intron in the genome
taxonomy_id
INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
scored_motifs
TEXT, motifs scored for the intron
phase
INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
in_cds
INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
relative_position
REAL, relative position of the intron within the transcript (as a percentage of coding length)

38 rows where transcript_id = 32191397

✎ View and edit SQL

This data as json, CSV (advanced)

Suggested facets: dinucleotide_pair, score, phase

id ▼ dinucleotide_pair is_minor score length transcript_id ordinal_index start end taxonomy_id scored_motifs phase in_cds relative_position
179733065 GT-AG 0 0.0006270807868197 361 rna-XM_047131392.1 32191397 1 190406142 190406502 Schistocerca americana 7009 ATG|GTATGTATAC...TAAGTCTAAACA/TGTCAATTGATT...ACCAG|GCT 2 1 0.871
179733066 GT-AG 0 1.160956218272046e-05 31031 rna-XM_047131392.1 32191397 2 190406630 190437660 Schistocerca americana 7009 GTG|GTAAGTTGAA...GCATTTTTGATC/GCATTTTTGATC...TCCAG|CAC 0 1 2.746
179733067 GT-AG 0 4.04510558808364e-05 7938 rna-XM_047131392.1 32191397 3 190437877 190445814 Schistocerca americana 7009 CAG|GTACAATCAG...TATTTTTTAATG/TGTATTCTCATT...AACAG|TTC 0 1 5.934
179733068 GT-AG 0 0.006503751480704 7769 rna-XM_047131392.1 32191397 4 190446094 190453862 Schistocerca americana 7009 GAG|GTATATTGTT...CATGCTTTATAA/TTAATTTTCATT...CAAAG|AGG 0 1 10.053
179733069 GT-AG 0 1.000000099473604e-05 4296 rna-XM_047131392.1 32191397 5 190454040 190458335 Schistocerca americana 7009 AAG|GTGTGTAAGT...TTTTTCTTATTT/ATTTTTCTTATT...TTTAG|GAT 0 1 12.666
179733070 GT-AG 0 1.000000099473604e-05 14714 rna-XM_047131392.1 32191397 6 190458469 190473182 Schistocerca americana 7009 CAG|GTAAAACTGT...CTTGTTTTATTC/ACTTGTTTTATT...TGTAG|ATG 1 1 14.629
179733071 GT-AG 0 1.000000099473604e-05 85 rna-XM_047131392.1 32191397 7 190473311 190473395 Schistocerca americana 7009 AAT|GTAAGTGGAA...CATTTTATAATT/TTCATGTTCATT...TACAG|ATT 0 1 16.519
179733072 GT-AG 0 1.000000099473604e-05 24366 rna-XM_047131392.1 32191397 8 190473513 190497878 Schistocerca americana 7009 AAG|GTAATAAGAA...ACAGTGTTGAAC/TTGAACGTCATT...CATAG|GCT 0 1 18.246
179733073 GT-AG 0 0.08443381987335 11588 rna-XM_047131392.1 32191397 9 190498093 190509680 Schistocerca americana 7009 AAG|GTATCTCAAG...CATAACTTAACC/CATAACTTAACC...CATAG|ATC 1 1 21.405
179733074 GT-AG 0 1.000000099473604e-05 10213 rna-XM_047131392.1 32191397 10 190509964 190520176 Schistocerca americana 7009 CAG|GTGGGCCTTA...TTGTTTTTAGAT/TTTGTTTTTAGA...TTCAG|GTG 2 1 25.583
179733075 GT-AG 0 5.141201457638276e-05 139 rna-XM_047131392.1 32191397 11 190520353 190520491 Schistocerca americana 7009 ATG|GTAAATATTT...TTCCTTTTATTA/CCTTTTATTATT...TGCAG|TTT 1 1 28.181
179733076 GC-AG 0 1.000000099473604e-05 221 rna-XM_047131392.1 32191397 12 190520632 190520852 Schistocerca americana 7009 AAG|GCTAGTACAG...TTTATTGTAATT/AATTTATTTATT...TTCAG|TCT 0 1 30.248
179733077 GT-AG 0 1.000000099473604e-05 228 rna-XM_047131392.1 32191397 13 190521019 190521246 Schistocerca americana 7009 CAG|GTGAGATATT...TTTTCTTTACTT/TTTTTCTTTACT...TGCAG|CTC 1 1 32.699
179733078 GT-AG 0 1.000000099473604e-05 3049 rna-XM_047131392.1 32191397 14 190521388 190524436 Schistocerca americana 7009 CAG|GTGAGTTCTG...GTATCATTAGTT/TTTTGTATCATT...TATAG|AAG 1 1 34.78
179733079 GT-AG 0 1.0265569371442984e-05 13008 rna-XM_047131392.1 32191397 15 190524624 190537631 Schistocerca americana 7009 TTG|GTAAGTTACC...CTACTTTTATTC/ACTACTTTTATT...TTCAG|GTG 2 1 37.541
179733080 GT-AG 0 0.0015476878085221 35516 rna-XM_047131392.1 32191397 16 190537801 190573316 Schistocerca americana 7009 AAG|GTAACTATGC...CACACCTTAAAT/CTAATACTCACA...TACAG|GTA 0 1 40.035
179733081 GT-AG 0 5.293272428924629e-05 17551 rna-XM_047131392.1 32191397 17 190573688 190591238 Schistocerca americana 7009 CAG|GTAATTTTAG...TTATTTTTGAAT/GTGTTATTTATC...TGCAG|TCC 2 1 45.512
179733082 GT-AG 0 0.0001099663580414 212 rna-XM_047131392.1 32191397 18 190591412 190591623 Schistocerca americana 7009 GAG|GTAAACTAAT...TTTTTTTTACTG/ATTTTTTTTACT...TTCAG|CAA 1 1 48.066
179733083 GT-AG 0 1.000000099473604e-05 127 rna-XM_047131392.1 32191397 19 190591789 190591915 Schistocerca americana 7009 TAA|GTAAGGACTC...ACTTTTGTGACA/ACTTTTGTGACA...CAAAG|GTC 1 1 50.502
179733084 GT-AG 0 1.000000099473604e-05 175 rna-XM_047131392.1 32191397 20 190592117 190592291 Schistocerca americana 7009 ACG|GTGAGAGAGA...ATATTTTTATTC/TTTTTATTCACA...TACAG|CTA 1 1 53.469
179733085 GT-AG 0 0.0612903961700605 11858 rna-XM_047131392.1 32191397 21 190592486 190604343 Schistocerca americana 7009 CAG|GTAACCTTTA...TGATTATTGATT/TGATTATTGATT...TTCAG|GAA 0 1 56.333
179733086 GT-AG 0 1.000000099473604e-05 6685 rna-XM_047131392.1 32191397 22 190604475 190611159 Schistocerca americana 7009 AAG|GTATGAACTG...GACTCCATGAAA/TTTTGTTGTACC...TACAG|GTG 2 1 58.267
179733087 GT-AG 0 0.0001968442907273 1256 rna-XM_047131392.1 32191397 23 190611381 190612636 Schistocerca americana 7009 CAG|GTGTGCATTT...ACATCTTTAAAA/TTAAAAATTATT...TACAG|AAC 1 1 61.529
179733088 GT-AG 0 1.000000099473604e-05 13309 rna-XM_047131392.1 32191397 24 190612780 190626088 Schistocerca americana 7009 GAG|GTAAGAATAA...ATTACTTTGTTG/ATCTAGTTGATA...TGCAG|CAT 0 1 63.64
179733089 GT-AG 0 1.000000099473604e-05 1464 rna-XM_047131392.1 32191397 25 190626326 190627789 Schistocerca americana 7009 CAG|GTAAGTAATG...TTTTTCTTATTA/ATTTTTCTTATT...CACAG|CAA 0 1 67.139
179733090 GT-AG 0 1.000000099473604e-05 20968 rna-XM_047131392.1 32191397 26 190627919 190648886 Schistocerca americana 7009 CAG|GTAAAGTACA...TTATACTGAACA/ATGTTGCTAACC...TCCAG|GCA 0 1 69.043
179733091 GT-AG 0 0.0015283375831663 11711 rna-XM_047131392.1 32191397 27 190649092 190660802 Schistocerca americana 7009 TAA|GTATGTCCAC...GATTTCTAAATA/TGAATACTGATT...TACAG|TTG 1 1 72.07
179733092 GT-AG 0 1.000000099473604e-05 22320 rna-XM_047131392.1 32191397 28 190661013 190683332 Schistocerca americana 7009 ATG|GTAAATCAGT...GTATCTTTGTCT/CTTCAACTAAAT...AACAG|AAA 1 1 75.17
179733093 GT-AG 0 0.0041671135660701 91 rna-XM_047131392.1 32191397 29 190683497 190683587 Schistocerca americana 7009 CAG|GTAACTTTAC...ATTACCATGATT/TGATTTGTCATT...TGCAG|TTG 0 1 77.591
179733094 GT-AG 0 1.000000099473604e-05 78 rna-XM_047131392.1 32191397 30 190683786 190683863 Schistocerca americana 7009 CAG|GTGGGAATAT...TTGTTATTAATG/TTGTTATTAATG...AACAG|TAT 0 1 80.514
179733095 GT-AG 0 2.0674082601236973e-05 1158 rna-XM_047131392.1 32191397 31 190684022 190685179 Schistocerca americana 7009 CAG|GTATAAACCT...TATGCATTGAAT/CATAGTTTAATT...TTTAG|ATC 2 1 82.846
179733096 GT-AG 0 1.000000099473604e-05 95 rna-XM_047131392.1 32191397 32 190685307 190685401 Schistocerca americana 7009 CAG|GTAATTACAG...GTTGCCCTATCT/CACTTTCTCAAT...TTTAG|GTT 0 1 84.721
179733097 GT-AG 0 1.000000099473604e-05 8113 rna-XM_047131392.1 32191397 33 190685632 190693744 Schistocerca americana 7009 TTT|GTGAGTTTTT...ATATATTTATTT/TATTTACTAACT...TTTAG|GTA 2 1 88.116
179733098 GT-AG 0 1.000000099473604e-05 3802 rna-XM_047131392.1 32191397 34 190693893 190697694 Schistocerca americana 7009 AAG|GTAAGCTACT...CAGTCTGTGGCA/AGCGATATAATG...TCCAG|AGA 0 1 90.301
179733099 GT-AG 0 1.000000099473604e-05 106 rna-XM_047131392.1 32191397 35 190697789 190697894 Schistocerca americana 7009 TAG|GTAAGTCAAA...ATAACCTAATTA/TATAACCTAATT...ACTAG|GTT 1 1 91.689
179733100 GT-AG 0 1.000000099473604e-05 13654 rna-XM_047131392.1 32191397 36 190698086 190711739 Schistocerca americana 7009 AAG|GTAATGGCTT...GTTTCTCTAATA/GTTTCTCTAATA...TACAG|GAG 0 1 94.508
179733101 GT-AG 0 3.548902474590609e-05 9111 rna-XM_047131392.1 32191397 37 190711953 190721063 Schistocerca americana 7009 CAG|GTATGGATGA...ATACTTTTAAAG/AAAGCATTAACA...TGCAG|GCA 0 1 97.653
179733102 GT-AG 0 0.0001928973078361 7232 rna-XM_047131392.1 32191397 38 190721172 190728403 Schistocerca americana 7009 ATG|GTAACTGTCT...GTTTGTGTAACA/CTAAAGTTCACT...TCCAG|CCA 0 1 99.247

Advanced export

JSON shape: default, array, newline-delimited, object

CSV options:

CREATE TABLE "introns" (
"id" INTEGER,
  "dinucleotide_pair" TEXT,
  "is_minor" INTEGER,
  "score" REAL,
  "length" INTEGER,
  "transcript_id" INTEGER,
  "ordinal_index" INTEGER,
  "start" INTEGER,
  "end" INTEGER,
  "taxonomy_id" INTEGER,
  "scored_motifs" TEXT,
  "phase" INTEGER,
  "in_cds" INTEGER,
  "relative_position" REAL
  ,PRIMARY KEY ([id]),
   FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
   FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
    ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
    ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
    ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
    ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
    ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
    ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
    ON [introns] ([in_cds]);
Powered by Datasette · Queries took 28.779ms · Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)