Skip to content

A small python library to process sequencing data.

License

Notifications You must be signed in to change notification settings

houruiyan/pyseqlib

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pySeqLib

A small python library to process sequencing data. It is under active development.

What's inside?

  • intronX: a program to fetch sequence related features for introns: 1) length, 2) splice site motif strength, 3) second structure energy 4) frequecy of k-mers. See manual and example.
  • pymfold: a python wrap of mfold for calculate energy of RNA secondary structure. We got installation issue from mfold, similar as this. Therefore, we recommend using pyRNAfold instead.
  • pyRNAfold: a python wrap of for another predictor, RNAfold, of RNA secondary structure energy. It is from the ViennaRNA package, which can be installed from the source code, or from BioConda.
  • motif-score: a motif score calculate
  • a sequence mapper to find lariat in RNA-seq reads (under test)
  • a sam file resampling method

How to install?

Pyseqlib was initially developed in Python 2 environment, hence best to be used in Py2 environment. By using Anaconda platform, no matter Py2 or Py 3, it is easy to set up a conda environment with Py2, for example by following commond:

conda create -n Py2 python=2.7 numpy==1.15.4 pysam==0.15.2 Cython==0.29.3 matplotlib==2.2.3
source activate Py2

Once you are in a Python 2 environment, you can download the codes from this github repository and then run the following command line:

python setup.py install

If you don't have the root permission, add --user.

Rquired libraries: pysam, numpy, Cython.

About

A small python library to process sequencing data.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 58.0%
  • Jupyter Notebook 41.7%
  • Shell 0.3%