Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Yale-LILY/DYLE

Repository files navigation

DYLE

Source code for ACL 2022 paper DYLE: Dynamic Latent Extraction for Abstractive Long-Input Summarization

Dependency

Install dependencies via:

conda create -n dyle python=3.9.6
conda activate dyle
conda install pytorch==1.8.0 torchvision==0.9.0 torchaudio==0.8.0 cudatoolkit=11.1 -c pytorch -c conda-forge
pip install nltk==3.6.2 pyrouge==0.1.3 transformers==4.8.1 rouge==1.0.0 datasets==1.11.0

Folder Structure

  • dataloaders: the python scripts to convert original dataset to the uniform format.
  • oracle: Scripts to generate extractive oracles
  • utils: Various utility functions, such as cleaning and rouge
  • Experiment.py: Main file for our model
  • config.py: Set model configuration
  • Modules: Contains implementation of our dynamic extraction module
  • test.py: Run test set
  • train.py: Train the model

Training and Evaluation

Download the Datasets and Models

Training the Model

  • After we clean the datasets and process the oracles, setup the paths of scripts at dataloaders/*.py
  • Set the self.target_task flag in config.py to choose the target task
  • Train the model by the command:
python train.py
  • The code reproduced using the PyTorch Lightning library can be found at this link

Evaluation

  • First download the checkpoint from Google Drive and place the folders under ./outputs/saved_model/
  • Set the self.target_task flag in config.py to choose the target task
python test.py

Citation

@inproceedings{mao2021dyle,
 title={DYLE: Dynamic Latent Extraction for Abstractive Long-Input Summarization},
 author={Mao, Ziming and Wu, Chen Henry and Ni, Ansong and Zhang, Yusen and Zhang, Rui and Yu, Tao and Deb, Budhaditya and Zhu, Chenguang and Awadallah, Ahmed H and Radev, Dragomir},
 booktitle={ACL 2022},
 year={2022}
}

About

Repository for ACL'22 paper: Dynamic Latent Extraction for Abstractive Long-Input Summarization

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

Languages

AltStyle によって変換されたページ (->オリジナル) /