home / WtMTA

introns

Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)

id
INTEGER (primary key), globally unique identifier for each intron
dinucleotide_pair
TEXT, terminal dinucleotide sequences of the intron
is_minor
INTEGER, indicates if the intron is a minor intron (1) or not (0)
score
REAL, score representing the probability (0-100%) of the intron being minor
length
INTEGER, length of the intron in base pairs
transcript_id
INTEGER (foreign key referencing transcripts(id)), parent transcript
ordinal_index
INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
start
INTEGER, start position of the intron in the genome
end
INTEGER, end position of the intron in the genome
taxonomy_id
INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
scored_motifs
TEXT, motifs scored for the intron
phase
INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
in_cds
INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
relative_position
REAL, relative position of the intron within the transcript (as a percentage of coding length)

29 rows where transcript_id = 35103490

✎ View and edit SQL

This data as json, CSV (advanced)

Suggested facets: score, length, phase

id ▼ dinucleotide_pair is_minor score length transcript_id ordinal_index start end taxonomy_id scored_motifs phase in_cds relative_position
197657323 GT-AG 0 1.000000099473604e-05 162 rna-XM_007047236.2 35103490 1 2799727 2799888 Theobroma cacao 3641 AAG|GTAATCGGTA...TGTATCATGATC/AAATGGCTAACT...TTTAG|GTC 0 1 24.806
197657324 GT-AG 0 8.467143462426364e-05 89 rna-XM_007047236.2 35103490 2 2800060 2800148 Theobroma cacao 3641 CAT|GTAAGTTGTT...TTGTACTTGATG/TGATGCTTCACT...TCAAG|GAT 0 1 28.497
197657325 GT-AG 0 1.000000099473604e-05 136 rna-XM_007047236.2 35103490 3 2800233 2800368 Theobroma cacao 3641 AAG|GTTTGCCAAT...TTATTCTGATTC/GTTATTCTGATT...TTCAG|AGG 0 1 30.311
197657326 GT-AG 0 1.000000099473604e-05 239 rna-XM_007047236.2 35103490 4 2800444 2800682 Theobroma cacao 3641 AAG|GTTAATCACT...GTTACCTTATGT/TTCGTCCTCATT...TCCAG|GAT 0 1 31.93
197657327 GT-AG 0 1.000000099473604e-05 877 rna-XM_007047236.2 35103490 5 2800748 2801624 Theobroma cacao 3641 CAG|GTTTAAGTAG...GTTTTTTTACTA/GGTTTTTTTACT...TGCAG|TGC 2 1 33.333
197657328 GT-AG 0 1.000000099473604e-05 105 rna-XM_007047236.2 35103490 6 2801716 2801820 Theobroma cacao 3641 AGG|GTGAGGTTCT...CTTATGTTGACA/CTTATGTTGACA...GATAG|ATC 0 1 35.298
197657329 GT-AG 0 1.000000099473604e-05 670 rna-XM_007047236.2 35103490 7 2802010 2802679 Theobroma cacao 3641 AAG|GTTAGTTGAA...GCTGTTTTGACC/GCTGTTTTGACC...TGCAG|ATA 0 1 39.378
197657330 GT-AG 0 1.000000099473604e-05 117 rna-XM_007047236.2 35103490 8 2802844 2802960 Theobroma cacao 3641 CAG|GTAAGTTATA...TATATTTTAGTG/TTATATTTTAGT...TGCAG|TGA 2 1 42.919
197657331 GT-AG 0 1.000000099473604e-05 474 rna-XM_007047236.2 35103490 9 2803076 2803549 Theobroma cacao 3641 CAG|GTGGGGGATC...TTGCCTCTAACC/ATTGTGCTAATA...TGCAG|GCT 0 1 45.402
197657332 GT-AG 0 0.0001904949493619 92 rna-XM_007047236.2 35103490 10 2803730 2803821 Theobroma cacao 3641 GAG|GTATTTCAAT...CTTTGCTTATTA/CCTTTGCTTATT...CTCAG|GTT 0 1 49.288
197657333 GT-AG 0 0.003900974159124 359 rna-XM_007047236.2 35103490 11 2803936 2804294 Theobroma cacao 3641 CTT|GTAGGTTCTG...ATGCTTTTAATT/ATGCTTTTAATT...TTCAG|ATT 0 1 51.749
197657334 GT-AG 0 1.000000099473604e-05 82 rna-XM_007047236.2 35103490 12 2804415 2804496 Theobroma cacao 3641 CAG|GTGAGTTCTG...TTTGTCTCAATA/ATTTGTCTCAAT...TTCAG|GGT 0 1 54.339
197657335 GT-AG 0 1.000000099473604e-05 151 rna-XM_007047236.2 35103490 13 2804815 2804965 Theobroma cacao 3641 CAG|GTTAATAATG...AGTCTTTTAATG/TCTAATCTGATT...TACAG|GAA 0 1 61.205
197657336 GT-AG 0 1.000000099473604e-05 714 rna-XM_007047236.2 35103490 14 2805059 2805772 Theobroma cacao 3641 GAG|GTAAAGATAT...TTTGTTTTACAT/ATTTGTTTTACA...TGTAG|TTG 0 1 63.212
197657337 GT-AG 0 1.000000099473604e-05 257 rna-XM_007047236.2 35103490 15 2805892 2806148 Theobroma cacao 3641 TAG|GTTTGTGTGT...TGTAGTTTGATA/TGTAGTTTGATA...TGCAG|TAT 2 1 65.782
197657338 GT-AG 0 1.000000099473604e-05 124 rna-XM_007047236.2 35103490 16 2806222 2806345 Theobroma cacao 3641 CAA|GTAAGAGAGA...TATTGTTTATTT/ATATTGTTTATT...TGCAG|GGA 0 1 67.358
197657339 GT-AG 0 0.0360733242498312 103 rna-XM_007047236.2 35103490 17 2806403 2806505 Theobroma cacao 3641 GAG|GTATTCTCTT...GATATCATATCC/AATGTATTAAGC...GGCAG|GTA 0 1 68.588
197657340 GT-AG 0 1.000000099473604e-05 525 rna-XM_007047236.2 35103490 18 2806599 2807123 Theobroma cacao 3641 GAG|GTCAGTTTCT...TTTGTGTTACTC/TTTTGTGTTACT...TCTAG|GTA 0 1 70.596
197657341 GT-AG 0 1.708596748469965e-05 71 rna-XM_007047236.2 35103490 19 2807247 2807317 Theobroma cacao 3641 GAG|GTAATTTTGA...ATAACCTTTTTG/CTCTAACTAATG...TTCAG|GTT 0 1 73.251
197657342 GT-AG 0 0.000209283328818 127 rna-XM_007047236.2 35103490 20 2807416 2807542 Theobroma cacao 3641 AGG|GTATGTAATA...GTCACTTTGGCT/GGGTTACTGAGC...CACAG|GTC 2 1 75.367
197657343 GT-AG 0 1.000000099473604e-05 233 rna-XM_007047236.2 35103490 21 2807589 2807821 Theobroma cacao 3641 AAG|GTAATTACCA...CTGCTTGTAATG/AAGTTGCTGACT...AACAG|TCA 0 1 76.36
197657344 GT-AG 0 1.000000099473604e-05 112 rna-XM_007047236.2 35103490 22 2807936 2808047 Theobroma cacao 3641 CAG|GTGATTTCAA...ATTGCTTGAACT/CTTGAACTGATT...TCTAG|GTG 0 1 78.821
197657345 GT-AG 0 0.6704037309828028 136 rna-XM_007047236.2 35103490 23 2808132 2808267 Theobroma cacao 3641 CAG|GTATCTTTAT...TTCATCTCAATC/TCTTCTGTCATT...GATAG|GGG 0 1 80.635
197657346 GT-AG 0 1.000000099473604e-05 100 rna-XM_007047236.2 35103490 24 2808472 2808571 Theobroma cacao 3641 AAG|GTAATACCTT...ATGTTCTTATTT/AATGTTCTTATT...TCCAG|TTT 0 1 85.039
197657347 GT-AG 0 0.0027141930839722 104 rna-XM_007047236.2 35103490 25 2808649 2808752 Theobroma cacao 3641 GAG|GTAACTATTT...TCTCCCATGAAA/ATTTCTCCCATG...TGCAG|ATA 2 1 86.701
197657348 GT-AG 0 0.0006324486814978 229 rna-XM_007047236.2 35103490 26 2808973 2809201 Theobroma cacao 3641 CAG|GTATTTATAA...ATCTACTTAAAT/AATCTACTTAAA...GTTAG|GTC 0 1 91.451
197657349 GT-AG 0 0.0002385535489951 97 rna-XM_007047236.2 35103490 27 2809256 2809352 Theobroma cacao 3641 ATG|GTAATCTGTC...GGTTTATTAGCA/AACGTTTTCACT...TTCAG|TGT 0 1 92.617
197657350 GT-AG 0 0.0004618998521843 90 rna-XM_007047236.2 35103490 28 2809473 2809562 Theobroma cacao 3641 AAG|GTTTCATGAC...TATTTGTTATTT/TTGTTATTTACC...TATAG|TAC 0 1 95.207
197657351 GT-AG 0 1.967810347761994e-05 105 rna-XM_007047236.2 35103490 29 2809638 2809742 Theobroma cacao 3641 AAG|GTTTTTATAT...GTTGTCGTGATC/ATTGAACTGATT...TGCAG|ATG 0 1 96.826

Advanced export

JSON shape: default, array, newline-delimited, object

CSV options:

CREATE TABLE "introns" (
"id" INTEGER,
  "dinucleotide_pair" TEXT,
  "is_minor" INTEGER,
  "score" REAL,
  "length" INTEGER,
  "transcript_id" INTEGER,
  "ordinal_index" INTEGER,
  "start" INTEGER,
  "end" INTEGER,
  "taxonomy_id" INTEGER,
  "scored_motifs" TEXT,
  "phase" INTEGER,
  "in_cds" INTEGER,
  "relative_position" REAL
  ,PRIMARY KEY ([id]),
   FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
   FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
    ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
    ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
    ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
    ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
    ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
    ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
    ON [introns] ([in_cds]);
Powered by Datasette · Queries took 38.774ms · Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)