colossyan/syncnet-python

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
detectors		detectors
syncnet_python		syncnet_python
.gitignore		.gitignore
README.md		README.md
download_model.sh		download_model.sh
requirements.txt		requirements.txt
setup.py		setup.py

Repository files navigation

SyncNet

Fork Notice: This is a fork of the original SyncNet repository by Joon Son Chung. This fork is maintained by Colossyan.

This repository contains the demo for the audio-to-video synchronisation network (SyncNet). This network can be used for audio-visual synchronisation tasks including:

Removing temporal lags between the audio and visual streams in a video;
Determining who is speaking amongst multiple faces in a video.

Please cite the paper below if you make use of the software.

Installation

Option 1: Install from GitHub (Recommended)

pip install git+https://github.com/colossyan/syncnet-python.git

Option 2: Install from source

git clone https://github.com/colossyan/syncnet-python.git
cd syncnet-python
pip install -e .

Dependencies

pip install -r requirements.txt

In addition, ffmpeg is required.

Usage

As a Python Package

from syncnet_python import SyncNetInstance
# Initialize SyncNet
syncnet = SyncNetInstance()
# Load pre-trained model
syncnet.loadParameters('path/to/pretrained_model.pth')
# Evaluate a video
class Args:
 def __init__(self):
 self.tmp_dir = '/tmp/syncnet'
 self.reference = 'test_video'
 self.batch_size = 20
 self.vshift = 10
opt = Args()
offset, conf, dists = syncnet.evaluate(opt, 'path/to/video.mp4')
print(f"Audio-Video offset: {offset}")
print(f"Confidence: {conf}")

Command Line Usage

SyncNet demo:

python demo_syncnet.py --videofile data/example.avi --tmp_dir /path/to/temp/directory

Check that this script returns:

AV offset: 3 
Min dist: 5.353
Confidence: 10.021

Full pipeline:

sh download_model.sh
python run_pipeline.py --videofile /path/to/video.mp4 --reference name_of_video --data_dir /path/to/output
python run_syncnet.py --videofile /path/to/video.mp4 --reference name_of_video --data_dir /path/to/output
python run_visualise.py --videofile /path/to/video.mp4 --reference name_of_video --data_dir /path/to/output

Outputs:

$DATA_DIR/pycrop/$REFERENCE/*.avi - cropped face tracks
$DATA_DIR/pywork/$REFERENCE/offsets.txt - audio-video offset values
$DATA_DIR/pyavi/$REFERENCE/video_out.avi - output video (as shown below)

Publications

@InProceedings{Chung16a,
 author = "Chung, J.~S. and Zisserman, A.",
 title = "Out of time: automated lip sync in the wild",
 booktitle = "Workshop on Multi-view Lip-reading, ACCV",
 year = "2016",
}

About

Fork of SyncNet: Audio-to-video synchronisation network for lip sync and speaker identification

Releases

No releases published

Packages

No packages published

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

colossyan/syncnet-python

Folders and files

Latest commit

History

Repository files navigation

SyncNet

Installation

Option 1: Install from GitHub (Recommended)

Option 2: Install from source

Dependencies

Usage

As a Python Package

Command Line Usage

Publications

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

colossyan/syncnet-python

Folders and files

Latest commit

History

Repository files navigation

SyncNet

Installation

Option 1: Install from GitHub (Recommended)

Option 2: Install from source

Dependencies

Usage

As a Python Package

Command Line Usage

Publications

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages