home / WtMTA

introns

Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)

id
INTEGER (primary key), globally unique identifier for each intron
dinucleotide_pair
TEXT, terminal dinucleotide sequences of the intron
is_minor
INTEGER, indicates if the intron is a minor intron (1) or not (0)
score
REAL, score representing the probability (0-100%) of the intron being minor
length
INTEGER, length of the intron in base pairs
transcript_id
INTEGER (foreign key referencing transcripts(id)), parent transcript
ordinal_index
INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
start
INTEGER, start position of the intron in the genome
end
INTEGER, end position of the intron in the genome
taxonomy_id
INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
scored_motifs
TEXT, motifs scored for the intron
phase
INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
in_cds
INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
relative_position
REAL, relative position of the intron within the transcript (as a percentage of coding length)

28 rows where transcript_id = 35103501

✎ View and edit SQL

This data as json, CSV (advanced)

Suggested facets: dinucleotide_pair, score, phase, in_cds

id ▼ dinucleotide_pair is_minor score length transcript_id ordinal_index start end taxonomy_id scored_motifs phase in_cds relative_position
197657513 GT-AG 0 1.000000099473604e-05 434 rna-XM_007052523.2 35103501 2 37164655 37165088 Theobroma cacao 3641 AAG|GTGAATTTGG...GCTTCTTTAGAT/GTTGGGCTCAAT...TGCAG|GTG 1 1 8.946
197657514 GT-AG 0 1.000000099473604e-05 210 rna-XM_007052523.2 35103501 3 37165151 37165360 Theobroma cacao 3641 CAA|GTGAGCCAAC...AAATCCTGAATT/TTATTGCTAACG...TGCAG|CTG 0 1 10.336
197657515 GT-AG 0 0.0002038932640245 74 rna-XM_007052523.2 35103501 4 37165551 37165624 Theobroma cacao 3641 CTG|GTATAGTAGC...TGGGCTCTAACC/TGGGCTCTAACC...GGCAG|CAT 1 1 14.596
197657516 GT-AG 0 1.000000099473604e-05 102 rna-XM_007052523.2 35103501 5 37165766 37165867 Theobroma cacao 3641 GTG|GTTTGGAATT...AGGATTTTGACT/AGGATTTTGACT...TTTAG|ATG 1 1 17.758
197657517 GT-AG 0 1.000000099473604e-05 108 rna-XM_007052523.2 35103501 6 37166025 37166132 Theobroma cacao 3641 AAG|GTAAAATGCT...TTTTTTTTAATG/TTTTTTTTAATG...GACAG|GCT 2 1 21.278
197657518 GT-AG 0 1.000000099473604e-05 110 rna-XM_007052523.2 35103501 7 37166327 37166436 Theobroma cacao 3641 AAG|GTAACAGAAT...TTTCCTTTTGTG/GTATCGTTTACC...TGAAG|CTT 1 1 25.628
197657519 GT-AG 0 2.325009191726608e-05 352 rna-XM_007052523.2 35103501 8 37166539 37166890 Theobroma cacao 3641 CTG|GTAAGTTTGA...TGATCCTTTTCC/CATAGTTTGATC...TTCAG|ATC 1 1 27.915
197657520 GT-AG 0 1.826832940031282e-05 675 rna-XM_007052523.2 35103501 9 37167027 37167701 Theobroma cacao 3641 AAG|GTACATAATT...TTCTGCTTATTA/TTTCTGCTTATT...GGCAG|GAT 2 1 30.964
197657521 GT-AG 0 1.000000099473604e-05 209 rna-XM_007052523.2 35103501 10 37167831 37168039 Theobroma cacao 3641 CAT|GTTAGTGTTC...TTATTCTTCTCT/TTCTCTATCATT...TTCAG|GTA 2 1 33.857
197657522 GT-AG 0 1.000000099473604e-05 143 rna-XM_007052523.2 35103501 11 37168440 37168582 Theobroma cacao 3641 CAG|GTGAAGCCAT...TTTTTCTAAAAT/GTTTTTCTAAAA...TGTAG|AGT 0 1 42.825
197657523 GT-AG 0 1.000000099473604e-05 267 rna-XM_007052523.2 35103501 12 37168688 37168954 Theobroma cacao 3641 AAG|GTGAGGAAAT...ACTTCTATATTT/CGGGTGCTGACT...TTTAG|GTG 0 1 45.179
197657524 GT-AG 0 1.000000099473604e-05 287 rna-XM_007052523.2 35103501 13 37169074 37169360 Theobroma cacao 3641 CAG|GTGATTGAAT...TCCCCCCTAATT/CCTAATTTCATT...TTTAG|GCA 2 1 47.848
197657525 GT-AG 0 6.005694539166183e-05 78 rna-XM_007052523.2 35103501 14 37169470 37169547 Theobroma cacao 3641 CAG|GTATAACATT...TGCTCCATATCT/CCATATCTGATG...TGAAG|AGG 0 1 50.291
197657526 GC-AG 0 1.000000099473604e-05 470 rna-XM_007052523.2 35103501 15 37169692 37170161 Theobroma cacao 3641 CAG|GCAAGGATCT...TTTTTCTTCCTG/ACTGTGCTCATG...CACAG|GGT 0 1 53.52
197657527 GT-AG 0 1.000000099473604e-05 86 rna-XM_007052523.2 35103501 16 37170288 37170373 Theobroma cacao 3641 GAG|GTGAGTCTCA...ATTTTCTTAGAT/GATTTTCTTAGA...ACCAG|GAG 0 1 56.345
197657528 GT-AG 0 0.0027964225167166 141 rna-XM_007052523.2 35103501 17 37170545 37170685 Theobroma cacao 3641 AAA|GTAAGCTTTA...GCCATTTTAACA/GATAAATTAATT...TACAG|GTT 0 1 60.179
197657529 GT-AG 0 1.000000099473604e-05 135 rna-XM_007052523.2 35103501 18 37170773 37170907 Theobroma cacao 3641 AAG|GTGATATCAT...TATTTCTTGTTA/ATGGTTATTACC...TGCAG|GAT 0 1 62.13
197657530 GT-AG 0 0.0039457846082912 293 rna-XM_007052523.2 35103501 19 37171040 37171332 Theobroma cacao 3641 AAA|GTATGTTTAA...CTGTATTTATTT/TCTGTATTTATT...TGCAG|GAG 0 1 65.09
197657531 GT-AG 0 9.90860217015156e-05 232 rna-XM_007052523.2 35103501 20 37171489 37171720 Theobroma cacao 3641 CAG|GTATTGATTT...AGTACATTAGTT/ATATGTATGATT...TGTAG|TTT 0 1 68.587
197657532 GT-AG 0 1.000000099473604e-05 428 rna-XM_007052523.2 35103501 21 37171790 37172217 Theobroma cacao 3641 AAG|GTAATAATGG...CTTACATTAATG/AATGTTCTGATT...TTCAG|GTG 0 1 70.135
197657533 GT-AG 0 2.050424887305456e-05 196 rna-XM_007052523.2 35103501 22 37172377 37172572 Theobroma cacao 3641 GAA|GTAAGCACTG...ATATTTGTGACA/TCGTGTTTCAAT...CTTAG|GGC 0 1 73.7
197657534 GT-AG 0 1.000000099473604e-05 85 rna-XM_007052523.2 35103501 23 37172630 37172714 Theobroma cacao 3641 CAG|GTAAGTTAAG...CTTTTCATGATC/CAGCTTTTCATG...TTCAG|GTC 0 1 74.978
197657535 GT-AG 0 0.0001721322157211 169 rna-XM_007052523.2 35103501 24 37172756 37172924 Theobroma cacao 3641 TAG|GTAGTTATCA...TCTGTCTTAATA/ATAGTTTTCACT...TTCAG|GTA 2 1 75.897
197657536 GC-AG 0 5.51321595019106e-05 160 rna-XM_007052523.2 35103501 25 37173045 37173204 Theobroma cacao 3641 TAG|GCATGATTTC...TAGGTCTTAATT/TAGGTCTTAATT...TGTAG|TTA 2 1 78.587
197657537 GC-AG 0 1.000000099473604e-05 82 rna-XM_007052523.2 35103501 26 37173515 37173596 Theobroma cacao 3641 CAG|GCATGTGTTA...ATTCATTTAACT/ATTTAACTGATT...CACAG|CAT 0 1 85.538
197657538 GT-AG 0 1.000000099473604e-05 147 rna-XM_007052523.2 35103501 27 37173705 37173851 Theobroma cacao 3641 CAG|GTAAGTGAAT...TTGTCCTTTGTG/AAGTTATTGATT...TTCAG|ATG 0 1 87.96
197657539 GT-AG 0 1.000000099473604e-05 430 rna-XM_007052523.2 35103501 28 37173924 37174353 Theobroma cacao 3641 AAG|GTACTTCAAT...TGGGTTTTAAAC/TTTTTGTTTACC...GACAG|GCC 0 1 89.574
197671375 GT-AG 0 1.000000099473604e-05 802 rna-XM_007052523.2 35103501 1 37163617 37164418 Theobroma cacao 3641 TAG|GTAATATGAT...GACTCTTTATAT/TATATGCTAATG...ATCAG|GTT   0 3.879

Advanced export

JSON shape: default, array, newline-delimited, object

CSV options:

CREATE TABLE "introns" (
"id" INTEGER,
  "dinucleotide_pair" TEXT,
  "is_minor" INTEGER,
  "score" REAL,
  "length" INTEGER,
  "transcript_id" INTEGER,
  "ordinal_index" INTEGER,
  "start" INTEGER,
  "end" INTEGER,
  "taxonomy_id" INTEGER,
  "scored_motifs" TEXT,
  "phase" INTEGER,
  "in_cds" INTEGER,
  "relative_position" REAL
  ,PRIMARY KEY ([id]),
   FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
   FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
    ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
    ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
    ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
    ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
    ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
    ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
    ON [introns] ([in_cds]);
Powered by Datasette · Queries took 29.137ms · Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)