Jayden

Jayden

De novo assemble RNA-seq sequence

Intro#

A research paper published on biorxiv determined a new coronavirus subgenus, I would like to figure out is there any changes on protease. However, the sequence data has not been publish.

Fortunately, the similar sequence is do available on NCBI, unfortunately, only RNA-seq data is available.

So I need to assemble the RNA-seq reads first, and BLAST the sequence I need with the assembled data.

TL;DR#

  1. Setup the environment with conda:

  2. Fetch the data:

  3. Data quality check

  4. Quality control using fastp

  5. Data quality check (post-cleaning data)

  6. Assemble with Trinity

  7. Check the Trinity result:

  8. BLAST sequence of interest

    1. Put your sequence in query.fasta.

    2. Make BLAST database and run:

  9. Check the BLAST result:

  10. Extract the sequence from trinity.Trinity.fasta

Tail#

  1. You can also blast with the Predicted sequence:

  2. Make BLAST database and run:

此文由 Mix Space 同步更新至 xLog
原始链接为 https://xxu.do/posts/academic/De-novo-assemble-RNA-seq-sequence


Loading...
Ownership of this post data is guaranteed by blockchain and smart contracts to the creator alone.