qbatch

python program for serial farming jobs on compute clusters

[画像:CoBrALab logo]

Source Code Changelog

Suggest Changes

Popularity

1.4

Growing

Activity

0.0

Stable

Stars 33

Watchers 3

Forks 13

Last Commit 5 months ago

Description

Execute shell command lines in parallel on SGE/PBS clusters

qbatch is a tool for executing commands in parallel across a compute cluster. It takes as input a list of commands (shell command lines or executable scripts) in a file or piped to qbatch. The list of commands are divided into arbitrarily sized chunks which are submitted as jobs to the cluster either as individual submissions or an array. Each job runs the commands in its chunk in parallel. Commands can also be run locally on systems with no cluster capability.

Code Quality Rank: L5

Programming language: Python

License: The Unlicense

Tags: Command-line Tools Productivity Tools Cluster System Utilities Clustering Distributed Computing

Latest version: v2.2.1

qbatch alternatives and similar packages

Based on the "Productivity Tools" category.
Alternatively, view qbatch alternatives based on common mentions on social networks and blogs.

thefuck

9.9 1.7 L5 qbatch VS thefuck

Magnificent app which corrects your previous console command.

nvbn logo
httpie

9.7 6.6 L3 qbatch VS httpie

🥧 HTTPie CLI — modern, user-friendly command-line HTTP client for the API era. JSON support, colors, sessions, downloads, plugins & more.

httpie logo

Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.

Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

Promo getstream.io

[画像:Stream Logo]

cookiecutter

9.5 6.8 L5 qbatch VS cookiecutter

A cross-platform command-line utility that creates projects from cookiecutters (project templates), e.g. Python package projects, C projects.

cookiecutter logo
aws-cli

9.4 9.8 L3 qbatch VS aws-cli

Universal Command Line Interface for Amazon Web Services

aws logo
pgcli

8.9 6.8 L3 qbatch VS pgcli

Postgres CLI with autocompletion and syntax highlighting

dbcli logo
mycli

8.8 9.4 L3 qbatch VS mycli

A Terminal Client for MySQL with AutoCompletion and Syntax Highlighting.

dbcli logo
howdoi

8.8 3.4 L4 qbatch VS howdoi

instant coding answers via the command line

gleitz logo
HTTP Prompt

8.5 0.0 L4 qbatch VS HTTP Prompt

An interactive command-line HTTP and API testing client built on top of HTTPie featuring autocomplete, syntax highlighting, and more. https://twitter.com/httpie

httpie logo
SAWS

7.8 0.0 L5 qbatch VS SAWS

A supercharged AWS command line interface (CLI).

donnemartin logo
PathPicker

7.7 4.0 L4 qbatch VS PathPicker

PathPicker accepts a wide range of input -- output from git commands, grep results, searches -- pretty much anything. After parsing the input, PathPicker presents you with a nice UI to select which files you're interested in. After that you can open them in your favorite editor or execute arbitrary commands.

facebook logo
percol

6.8 0.0 L4 qbatch VS percol

adds flavor of interactive filtering to the traditional pipe concept of UNIX shell

mooz logo
doitlive

6.7 6.7 L5 qbatch VS doitlive

Because sometimes you need to do it live

sloria logo
copier

6.6 9.5 qbatch VS copier

Library and command-line utility for rendering projects templates.

copier-org logo
bashplotlib

5.7 0.0 L3 qbatch VS bashplotlib

plotting in the terminal

glamp logo
Torrench

4.3 0.8 qbatch VS Torrench

DISCONTINUED. Command-line torrent search program (cross-platform)
try

3.9 0.0 L5 qbatch VS try

Dead simple CLI tool to try Python packages - It's never been easier! :package:

timofurrer logo
caniusepython3

3.3 0.0 L4 qbatch VS caniusepython3

DISCONTINUED. Can I Use Python 3?
PuePy

2.9 8.8 qbatch VS PuePy

Python+Webassembly Frontend Framework via PyScript

kkinder logo
SubGrab

2.0 0.0 qbatch VS SubGrab

SubGrab is a utility that allows you to automate subtitles downloading for your media files.

RafayGhafoor logo
geojson-shave

1.6 5.5 qbatch VS geojson-shave

A command-line tool for reducing the size of GeoJSON files.

ben-nour logo
OhCrab! 🦀

1.0 6.9 qbatch VS OhCrab! 🦀

Fix your terminal commands with the power of Rust

luizvbo logo
workedon

1.0 7.2 qbatch VS workedon

Work tracking from your shell.

viseshrp logo
Focus Phase

0.8 0.0 qbatch VS Focus Phase

A simple yet powerful timer and time tracker from the command line. https://ammar1y.github.io/Focus-Phase/

ammar1y logo
Marlin

0.7 0.0 qbatch VS Marlin

DISCONTINUED. Swim between bookmarks in the Windows terminal

wilfredinni logo
autopexpect

0.7 0.0 qbatch VS autopexpect

autoexpect for pexpect

ianmiell logo

* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.

Do you think we are missing an alternative of qbatch or a related project?

Add another 'Productivity Tools' Package

InfluxDB – Built for High-Performance Time Series Workloads

featured www.influxdata.com

Popular Comparisons

SaaSHub - Software Alternatives and Reviews

featured www.saashub.com

README

qbatch

Execute shell command lines in parallel on Slurm, S(on) of Grid Engine (SGE), PBS/Torque clusters

Travis CI build status

qbatch is a tool for executing commands in parallel across a compute cluster. It takes as input a list of commands (shell command lines or executable scripts) in a file or piped to qbatch. The list of commands are divided into arbitrarily sized chunks which are submitted as jobs to the cluster either as individual submissions or an array. Each job runs the commands in its chunk in parallel according to cores. Commands can also be run locally on systems with no cluster capability via gnu-paralel.

qbatch can also be used within python using the qbatch.qbatchParser and qbatch.qbatchDriver functions. qbatchParser will accept a list of command line options identical to the shell interface, parse, and submit jobs. The qbatchDriver interface will accept key-value pairs corresponding to the outputs of the argument parser, and additionally, the task_list option, providing a list of strings of commands to run.

Installation

$ pip install qbatch

Dependencies

qbatch requires python (>2.7) and GNU Parallel. For Torque/PBS and gridengine clusters, qbatch requires the qsub and qstat commands. For Slurm workload manager, qbatch requires the sbatch and squeue commands.

Environment variable defaults

qbatch supports several environment variables to customize defaults for your local system.

$ export QBATCH_PPJ=12 # requested processors per job
$ export QBATCH_CHUNKSIZE=$QBATCH_PPJ # commands to run per job
$ export QBATCH_CORES=$QBATCH_PPJ # commonds to run in parallel per job
$ export QBATCH_NODES=1 # number of compute nodes to request for the job, typically for MPI jobs
$ export QBATCH_MEM="0" # requested memory per job
$ export QBATCH_MEMVARS="mem" # memory request variable to set
$ export QBATCH_SYSTEM="pbs" # queuing system to use ("pbs", "sge","slurm", or "local")
$ export QBATCH_NODES=1 # (PBS-only) nodes to request per job
$ export QBATCH_SGE_PE="smp" # (SGE-only) parallel environment name
$ export QBATCH_QUEUE="1day" # Name of submission queue
$ export QBATCH_OPTIONS="" # Arbitrary cluster options to embed in all jobs
$ export QBATCH_SCRIPT_FOLDER=".qbatch/" # Location to generate jobfiles for submission
$ export QBATCH_SHELL="/bin/sh" # Shell to use to evaluate jobfile

Command line help

usage: qbatch [-h] [-w WALLTIME] [-c CHUNKSIZE] [-j CORES] [--ppj PPJ]
 [-N JOBNAME] [--mem MEM] [-q QUEUE] [-n] [-v] [--version]
 [--depend DEPEND] [-d WORKDIR] [--logdir LOGDIR] [-o OPTIONS]
 [--header HEADER] [--footer FOOTER] [--nodes NODES]
 [--sge-pe SGE_PE] [--memvars MEMVARS]
 [--pbs-nodes-spec PBS_NODES_SPEC] [-i]
 [-b {pbs,sge,slurm,local,container}] [--env {copied,batch,none}]
 [--shell SHELL]
 ...
Submits a list of commands to a queueing system. The list of commands can be
broken up into 'chunks' when submitted, so that the commands in each chunk run
in parallel (using GNU parallel). The job script(s) generated by qbatch are
stored in the folder .qbatch/
positional arguments:
 command_file An input file containing a list of shell commands to
 be submitted, - to read the command list from stdin or
 -- followed by a single command
optional arguments:
 -h, --help show this help message and exit
 -w WALLTIME, --walltime WALLTIME
 Maximum walltime for an array job element or
 individual job (default: None)
 -c CHUNKSIZE, --chunksize CHUNKSIZE
 Number of commands from the command list that are
 wrapped into each job (default: 1)
 -j CORES, --cores CORES
 Number of commands each job runs in parallel. If the
 chunk size (-c) is smaller than -j then only chunk
 size commands will run in parallel. This option can
 also be expressed as a percentage (e.g. 100%) of the
 total available cores (default: 1)
 --ppj PPJ Requested number of processors per job (aka ppn on
 PBS, slots on SGE, cpus per task on SLURM). Cores can
 be over subscribed if -j is larger than --ppj (useful
 to make use of hyper-threading on some systems)
 (default: 1)
 -N JOBNAME, --jobname JOBNAME
 Set job name (defaults to name of command file, or
 STDIN) (default: None)
 --mem MEM Memory required for each job (e.g. --mem 1G). This
 value will be set on each variable specified in
 --memvars. To not set any memory requirement, set this
 to 0 (default: 0)
 -q QUEUE, --queue QUEUE
 Name of queue to submit jobs to (defaults to no queue)
 (default: None)
 -n, --dryrun Dry run; Create jobfiles but do not submit or run any
 commands (default: False)
 -v, --verbose Verbose output (default: False)
 --version show program's version number and exit
advanced options:
 --depend DEPEND Wait for successful completion of job(s) with name
 matching given glob pattern or job id matching given
 job id(s) before starting (default: None)
 -d WORKDIR, --workdir WORKDIR
 Job working directory (default:
 current working directory)
 --logdir LOGDIR Directory to save store log files (default:
 {workdir}/logs)
 -o OPTIONS, --options OPTIONS
 Custom options passed directly to the queuing system
 (e.g --options "-l vf=8G". This option can be given
 multiple times (default: [])
 --header HEADER A line to insert verbatim at the start of the script,
 and will be run once per job. This option can be given
 multiple times (default: None)
 --footer FOOTER A line to insert verbatim at the end of the script,
 and will be run once per job. This option can be given
 multiple times (default: None)
 --nodes NODES (PBS and SLURM only) Nodes to request per job
 (default: 1)
 --sge-pe SGE_PE (SGE-only) The parallel environment to use if more
 than one processor per job is requested (default: smp)
 --memvars MEMVARS A comma-separated list of variables to set with the
 memory limit given by the --mem option (e.g.
 --memvars=h_vmem,vf) (default: mem)
 --pbs-nodes-spec PBS_NODES_SPEC
 (PBS-only) String to be inserted into nodes= line of
 job (default: None)
 -i, --individual Submit individual jobs instead of an array job
 (default: False)
 -b {pbs,sge,slurm,local,container}, --system {pbs,sge,slurm,local,container}
 The type of queueing system to use. 'pbs' and 'sge'
 both make calls to qsub to submit jobs. 'slurm' calls
 sbatch. 'local' runs the entire command list (without
 chunking) locally. 'container' creates a joblist and
 metadata file, to pass commands out of a container to
 a monitoring process for submission to a batch system.
 (default: local)
 --env {copied,batch,none}
 Determines how your environment is propagated when
 your job runs. "copied" records your environment
 settings in the job submission script, "batch" uses
 the cluster's mechanism for propagating your
 environment, and "none" does not propagate any
 environment variables. (default: copied)
 --shell SHELL Shell to use for spawning jobs and launching single
 commands (default: /bin/sh)

Some examples:

# Submit an array job from a list of commands (one per line)
# Generates a job script in ./.qbatch/ and job logs appear in ./logs/\
# All defaults are inherited from QBATCH_* environment variables
$ qbatch commands.txt
# Submit a single command to the cluster
$ qbatch -- echo hello
# Set the walltime for each job
$ qbatch -w 3:00:00 commands.txt
# Run 24 commands per job
$ qbatch -c24 commands.txt
# Pack 24 commands per job, run 12 in parallel at a time
$ qbatch -c24 -j12 commands.txt
# Start jobs after successful completion of existing jobs with names starting with "stage1_"
$ qbatch --afterok 'stage1_*' commands.txt
# Pipe a list of commands to qbatch
$ parallel echo process.sh {} ::: *.dat | qbatch -
# Run jobs locally with GNU Parallel, 12 commands in parallel
$ qbatch -b local -j12 commands.txt
# Many options don't make sense locally: chunking, individual vs array, nodes,
# ppj, highmem, and afterok are ignored

A python script example:

# Submit jobs to a cluster using the QBATCH_* environment defaults
import qbatch
task_list = ['echo hello', 'echo hello2']
qbatch.qbatchDriver(task_list = task_list)

Do not miss the trending, packages, news and articles with our weekly report.

Awesome Python is part of the LibHunt network. Terms. Privacy Policy.

(CC)

BY-SA

We recommend Spin The Wheel Of Names for a cryptographically secure random name picker.