Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

本项目是一个基于多模态数据(文本、音频、视频)的谎言检测系统。通过提取不同模态的特征并使用深度学习模型进行融合分析,实现对说话内容真实性判断。 This project is a lie detection system based on multi-modal data (text, audio, and video). By extracting features from different modalities and using deep learning models for fusion analysis,

Notifications You must be signed in to change notification settings

guotaogithub/TextAndImageAssignment

Repository files navigation

Multimodal Lie Detection System

This project is an assignment for the Artificial Intelligence major at Taylor's University.

Project Description

This project aims to detect whether a person is lying by analyzing multiple modalities including audio, video, and text. We use deep learning and feature fusion strategies to build a comprehensive judgment model.


Key Features

English
Multimodal Data Loader
Audio Feature Extraction
Text Feature Extraction
Visual Feature Extraction
Model Training with Fusion Strategies
Feature Importance Analysis
Visualization of Results

Tech Stack

  • Python 3.x
  • PyTorch – Deep Learning Framework
  • NumPy/Pandas – Data Processing
  • Matplotlib/Seaborn – Data Visualization
  • Git + GitHub – Version Control and Collaborative Development

📁 Key Files

Filename Description
DataConfig.py Data path configuration class
Main.py Main program entry point
ModelTrainer.py Model training and fusion logic
MultimodalDataLoader.py Multimodal data loading and processing
TextFeatureExtractor.py Text feature extraction module
VisualFeatureExtractor.py Visual feature extraction module
AudioFeatureExtractor.py Audio feature extraction module

📦 Dataset Structure

dataset/
├── Clips/ # Video clips
│ ├── false/
│ └── true/
├── Transcription/ # Text transcripts
│ ├── false/
│ └── true/
├── audio/ # Audio files
│ ├── false/
│ └── true/
└── Annotation/ # Annotation files
 └── annotation.csv

How to Use

  1. Install dependencies:

    pip install torch numpy pandas matplotlib seaborn
  2. Place your dataset in the dataset/ directory.

  3. Run the main program:

    python Main.py
  4. The program will automatically load multimodal data, train fusion models, and generate feature analysis charts.


Team Members

Name
Xiao Changhe
Guo Tao
Kan YiMing
Zhang ZhiAng
Zheng YaXin

License

MIT License - Commercial use allowed, please retain original author attribution.


Contact

For questions or collaboration opportunities, please contact:
guotao2beijing@gmail.com

About

本项目是一个基于多模态数据(文本、音频、视频)的谎言检测系统。通过提取不同模态的特征并使用深度学习模型进行融合分析,实现对说话内容真实性判断。 This project is a lie detection system based on multi-modal data (text, audio, and video). By extracting features from different modalities and using deep learning models for fusion analysis,

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

AltStyle によって変換されたページ (->オリジナル) /