Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

DomBennett/Project-cluster

Folders and files

NameName
Last commit message
Last commit date

Latest commit

History

14 Commits

Repository files navigation

Project-cluster

Identify and count clusters across a series of .sam files.

Usage

python run.py --help

Install

git clone https://github.com/DomBennett/Project-cluster.git

Or download the zipped folder:

wget https://github.com/DomBennett/Project-cluster/archive/master.zip

Requirements

  • One .sam file stored per folder
  • cdhit
  • Python (v2 or v3)

Steps

  • Convert .sam to .fasta by extracting the orthologous sequence identified within the .sam file.
  • Run cdhit
  • Count clusters with greater than min_nsqs
  • Report number of clusters per .sam in a .csv

Authors

D.J. Bennett & J.S. Eriksson

About

🌺 Short pipeline for counting number of clusters across .sam files

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

Contributors

AltStyle γ«γ‚ˆγ£γ¦ε€‰ζ›γ•γ‚ŒγŸγƒšγƒΌγ‚Έ (->γ‚ͺγƒͺγ‚ΈγƒŠγƒ«) /