Final poster session



The poster session will be held at the AOERC -->from 12:15 PM to 3:15 PM on Tuesday, March 18th, 2025.

Previous Reports (Spring 2024)

Custom Projects

Project nameAuthors
AmzBERT: Enhanced Multi-Label Sentiment Classification for E-commerce Product ReviewsZack Seifert
Weakly Supervised Automated Language Model Red-Teaming to Identify Likely Toxic PromptsHoujun Liu
Patent Classification Using Large Language ModelsLuke Mizuhashi
Text as outcome: Topic models within a causal inference framework Juliette Coly
Mining Molecular Logics through Human Language: Predicting and Decoding Transcription Factor Logics on Gene Expression through LLM and transformerGyu (Gyuhyeon) Kim
From Infant to Toddler to Preschooler: Analyzing Language Acquisition in Language Models Yash Shah
Using Iterative Back-Translation to Improve Neural Poetry Translation Andrew Chen
Interpreting parking signs with lightweight large language models (LLMs) Uche Ochuba
Exploring Themes and Outliers in CFPB Consumer ComplaintsJonathan Hague
Intelligent Interactive Large Language Model Planner: Responsive Personalized HomeRobot Angel Zhang, Gadi Mark Sznaier Camps
Classification of clinical syndromes from patient-reported symptoms on social media Evan Maestri
Transfer learning in audio-based emotion detection: surprising generalizability and limitations Shunyu Yao
ReaL Stories: RL for Adaptive AI Storytelling Aditya Sood, Aniket Mahajan, Ayaan Chand
Tailor-Made or Off-the-Rack? Comparing Domain-Specific and General-Domain Language Models on a Financial NLP Task Irina Alexandra Marton
Cross attention for Text and Image Multimodal data fusion Dongyeong Kim
Comparative Analysis of Foundation Models for Hospital Integration Suhana Bedi, Miguel Fuentes
Active learning in DPO through gradient portfolio optimizationJosh Leib Kazdan, Ziang Song
GRAFT: Graph Retrieval Augmented Fine Tuning for Multi-Hop Query Summarization Sunny Yu, Natalia Kokoromyti, Sonya Shi Jin
Diverse LLM Approaches in Essay Scoring: A Comparative Exploration of Many-Shot Prompting, LLM Jury Panels, and Model Fine-Tuning Alexa Sparks, Matias Hoyl, Rizwaan Malik
Optimizing Large Language Models to Solve Crossword Puzzles Ishan Mehta, Andrew Lipschultz, Ohm Patel
KAN-based Distillation in Language Modeling Nick Mecklenburg
Handle With Care! A Mechanistic Case Study of DPO Out-of-Distribution Extrapolation Ryan Park
Punk or Funk: Understanding the Performance of RoBERTa on Music Genre Classification Andrew Bempong, Deveen Harischandra
Sparse Full-Rank MLPs for Increased Efficiency of Language Modeling Aaryan Singhal, Quinn McIntyre
How Important is the Truth? Rehaan Ahmad, Joseph Tan
Catch Me If You DAN: Outsmarting Prompt Injections and Jailbreak Schemes with Recollection Alice Guo, Grace Jin, Jenny Wei
Query based Multi-document Summarizer and Image SynthesizerGeeta Jakkamsetti
Finish Your Peas! Utilizing Multi-Label ImageClassification to Identify Food Items and Ingredients for Recipe Suggestions and Reducing Food Waste Arianna Damiani, Prashaant Ranganathan
Engagement-based response selection for open-domain dialogueMarcelo Peña
FlowState: Composing foundation models and retrieval for issue priority level prediction Alex Gilbert, Gustavs Zilgalvis
Beyond IID Constraints: A Novel Approach to Identity Preference Optimization Amirhossein Afsharrad
Intrinsic Systematicity Evaluation: Evaluating the intrinsic systematicity of LLMs Ayush Chakravarthy
Numerous Multi-Pivot and Chained Pivot NMT for Low-Resource Language Translation Cees Armstrong, Kevin Reso
Enhancing Language-Concordant Clinical Text Translation with Zero-shot NERIvan Lopez, Min Woo Sun
Better Call Sheared-LLaMA-2.7B: Optimized Summarization for Legal Documents Varun Madan, Arunima Srivastav
Adapting Listen, Attend, and Spell to Enhance Brain-Computer Interfaces for Speech Decoding Dylan Iskandar, Brian Ni, Vedant Singh
Narrative Detection Across Nations in Online Social Media Discourse Sungbin Kim, Khaled Messai, Vikram Srinivasan
The First Proteinbender: A Novel "Structure-based Protein Search Engine" Ethan Zhang, Saahil Sundaresan, Zane Chan
Investigating Language Model Cross-lingual Transfer for NLP Regression Tasks Through Contrastive Learning With LLM Augmentations Raghav Ganesh, Raj Palleti
Chinese Poem Generator with Prefix Control Yitong Lu
DeviceBERT: Applied Transfer Learning With Targeted Annotations and Vocabulary Enrichment to Identify Medical Device and Component Terminology in FDA Recall Summaries Miriam Farrington
L-LLM: Large Language LEGO Models Alex Wang, Calvin Laughlin
Adapting BERT to non-Western Dialects: A Case Study on Nigerian Pidgin English Slurs Sathvik Nori, Adrian Adegbesan
Words and Wins: Enhancing Game Play with LLM Fine-Tuning by RL Xuanzi Chen, Zhengjia Huang
From Preferences to Principles: Automated Principle Generation for Language Models William Fang, Vikram Sivashankar
From Lies to Insights: Expanding and Understanding the LIAR DatasetFelix Zhan
HieroLM: Egyptian Hieroglyph Recovery with Next Word Prediction Language Model Xuheng Cai, Erica Zhang
Analyzing Sophia's Gradient Distributions in Language Model PretrainingRaghav Kapoor
Investigating Improvement to English-Tigrinya Translation via Transfer Learning Over Varying Languages Abel Dagne, Sheden Andemicael
Quality or Quantity? Comparing Domain-Adaptive Pre-training Approaches for Language Models with Mathematical Understanding Christine Ye, Alexandre Acra
Knowledge-Enhanced Language Models: A Comparative Study of RAG and Embedding Methods Adarsh Ambati, Nikash Chhadia
Optimizing Language Models for Safe Online Discourse: Developing Metrics and Models for Detoxifying Internet Conversations Steven Li, Steven Le
Making Silicon Sing Kadija Ismail, Imen Kedir
Active Learning for Efficient NLP Training Daniel Lee, Thomas Yim, Ibrahim Dharhan
Character Understanding in Literary Texts: Leveraging TinyLlama for Advanced Character Analysis in the LiSCU Dataset Katherine Wong
arXivBot: A Large Language Model Chatbot That Has High Factuality and Coverage by Few-Shot Grounding on arXiv Xiaofeng Tang
SENTINEL: A Heterogeneous Ensemble Framework for Detecting AI-Generated Text in the Era of Advanced Language Models Natalie Cao, Haocheng Fan
Predicting Stock Market Trends from News Articles And Price Trends using Transformers Kasra Naftchi-Ardebili, Karanpartap Singh
Merging ‘Personas’ in Multi-Agent Systems of Language Models Andy Dai, Sriya Mantena
Critical Learning Periods for Second Language Acquisition in Neural Language ModelsDaniel Wurgaft, Jerome Han
Enhancing Practice Problem Retrieval with Deep Learning: A Rewriter-Retriever-Reranker Approach Charles Joyner, Ronny Junkins, Mack Smith
SceneGrounder: Natural Language Scene Descriptions and Retrieval Augmented Generation for 3D Visual TasksHuy Nguyen, James Brown
RubricEval: A Scalable Human-LLM Evaluation Framework for Open-Ended Tasks Vineel Bhat
Medical Named Entity Recognition and Relation Extraction from Clinical NotesAmeya Jadhav, Sreyana Kukadia
The Invisible Author: Mapping AI Penetration in News JournalismJun Wang, Andrew Zhang
Developing a GPT-Based Autonomous Agent With Novel Workflow Execution CapabilitiesKenny Lam, Vaishnav Garodia
Improving speech brain-computer interface with conversation context Brian Lee, Allison Tee
Negotiation Copilot: Exploring Ways to Build an AI Negotiation Assistant Winson Cheng, Abhinav Agarwal
AuRA (Automated Retrieval-Augmented Generation (RAG) System Development)Robby Manihani
KoWhisper: Efficient Bilingual Speech-to-Text for Edge DeploymentJason Park, Harshit Gupta
Enhancing AI Creativity: A Multi-Agent Approach to Flash Fiction Generation with Small Open-Source Models Alex Wang, Berwyn Berwyn, Jermaine Zhao
UltimateMedLLM-Llama3-8B: Fine-tuning Llama 3 for Medical Question-Answering Jayson Meribe, Sean Zhang
PROCEED: Performance Routing Optimization for Cost-Efficient and Effective Deployment Lichu Acuña, Odin Farkas
Improving Spanish-Mapudungun Translation through Transfer Learning Eban Ebssa
EDU-RAG: A RAG Benchmark with Web-enhanced Content in Education Domain. Will RAG Help AI Tutor?Xinxi Chen, Jingxu Gao
Mapping the Mind: Knowledge-Graph Augmented Retrieval Nicholas Vo
Learning Semantic Complexities of NYT Connections Emily Zhang, Yanan Jiang, Peixuan Ye
SuLaLoM: Structured Classification of Tabular Data with Large Language Models Su Kara
AdaVid: Adaptive Video-Language PretrainingChaitanya Patel
PragMaBERT: Analyzing Pragmatic Markers in Political Speech Matt Wise, Houda Nait El Barj
Robotic AssistEMT: An EMT Chatbot Aanika Atluri, Sarah Barragan, Anusheh Chaudry
Knowledge Distillation of Deep Language Models for Electrification Information Extraction from Building PermitsTony Liu
Shared Representation of Language in Broca’s Area and Large Language ModelsAlisa Levin, Benyamin Meschede-Krasa, Yun Hwang
ModelFusionJoong Kun Lee
PragMaBERT: Analyzing Pragmatic Markers in Political Speech Matt Wise, Houda Nait El Barj
Comparative study between addition of one MAMBA block to Wav2Vec2 Pretrained model and Vanilla Pretrained model Puchiss Panitpotjaman
Multi-Task Alignment Using Steering Vectors Charles Li, Nahum Maru
Project Oracle: Autoregressive Future Event Prediction with Sequential Modeling and Transformers Brian Wu, Katherine Wang, Ismail Mardin
DelT5: Dynamic Token Deletion for Efficient Byte-level Language Models Julie Kallini
Formally Verify Generated CodeLivia Sun
Fine-tuning Digital Agents with BAGEL Trajectories Alfred Yu, An Doan
Optimal Brain Projection: Neural Network Compression using Mixtures of Subspaces Daniel Garcia
Mistriply: Encoding Human Algorithmic Processes into LMs for Teaching and Computation Harviel Kyle Arcilla, Colette Do
Item Difficulty Modeling for a Sentence Reading Efficiency Task with Language Model Simulations Wanjing Anya Ma
Improving Speech-to-Text Brain-Computer Interface Performance with Neural Decoders and Large Language Models Laywood Fayne, Mohammad Rehan Ghori
Advancing Automated Content Moderation using Large Language ModelsHarshit Gupta, Sidhant Bansal, Sneha Jayaganthan
Leveraging Language Models for Multiclass Classification of Unfair Clauses in Terms of ServiceShaurnav (Joy) Ghosh, Shrish Janarthanan
Talk To Me, Your Virtual AI Therapist: Advancing AI-Driven Psychotherapeutic Engagement with Sentiment Analysis George Birikorang, Nathan Paek, Zoe Lynch
The impact of LLM pruning for fine-tuning Varun Shanker, Sarah Chung
Curriculum Learning with TinyStories Michail Christiaan Melonas
Beyond Single Commands: Evaluating LLMs on Multiple Instruction Sequences Sagnik Bhattacharya, Vaastav Arora, Prateek Varshney
FinRAG: A Retrieval-Based Financial Analyst Krrish Chawla, Allen Naliath
ClimateGrantLLM: Benchmarking grant recommendation engines for natural language descriptions of climate resilient infrastructure capital projects Bhumikorn Kongtaveelert, Auddithio Nag, Peter Li
JEDI: Justifiable End-dialogue Driven Interaction for NPC Entities in Role-Playing Games Willy Chan, Omar Abul-Hassan, Sokserey Sun
Efficient Translation of Natural Language to First-Order Logic Using Step-by-Step Distillation Aliyan Ishfaq, Shreyas Sharma
Enhancing Partisanship Prediction in Congressional Speeches Amelia Leon, JB Jong Beom Lim, Sherry Yang
Needle in a Haystack: Probing Transformer Capabilities to Recognize Non-Star-Free Languages Richard Gu, Sambhav Gupta, Andy Tang
Forticode: A Benchmark for Evaluating the Robustness of Code Generation Models Against Adversarial Syntax Preserving Mutations Amrit Baveja, Anant Singhal
Disarming Sleeper Agents: A Novel Approach Using Direct Preference Optimization Katherine Worden, Jeong Shin
Now You See Me: Vision-enhanced BERT for obfuscated text abuse detection Dylan Zhou
The Potential of Large Language Models in Assisting Data Augmentation for East Asian Digital Humanities Fengyi Lin
Expanding Horizons in RAG: Exploring and Extending the Limits of RAPTOR Alex Laitenberger
The Shades of Meaning: Investigating LLMs’ Cross-lingual Representation of Grounded Structures Pinlin [Calvin] Xu, Garbo Chung
FolioLLM: Constructing portfolio of ETFs using Large Language Models Andrey Popov, Oleg Roshka
Integrating Domain Knowledge for Financial QA: A Multi-Retriever RAG Approach with LLMs Yukun Zhang, Stefan Elbl Droguett, Samyak Jain
Integrating Extra Linguistic Meaning into the BERT Framework Riley Carlson, Bradley Moon, Ishaan Singh
A Contextual Approach Towards Financial Sentiment AnalysisEmma Sun
Enhancing Construction Project Management through a Cross-Modal Retrieval System Jayadev Rajan
Large language models for sustainable food designAnna Thomas
News to Numbers: NLP Stock Return Predictions Shree Reddy, Henrique B. N. Monteiro, Lucas Werneck
Apollo: A Large Multi-Modal Model Capable of Sampling Videos at 8fpsOrr Zohar
Analyzing the Effectiveness of Morphologically Motivated Tokenization on Machine Translation for Low-Resource Languages Abhishek Vangipuram, Emiyare Ikwut-Ukwa, William Huang
Hivemind: An Architecture to Amalgamate Fine-Tuned LLMs Matthew Mattei, Matt Hsu, Ramya Iyer
Leveraging Long Context for Customer SupportIan Lim
Course Recommendation Chatbot Naama Bejerano, Emma Troast
From Headlines to Bottom Lines: Leveraging Earning Releases and News Headlines to Predict Stock Price MovementAnanya Krishnan, Jinny Chung, Charles Shaviro
Leverage Augmented Large Language Models to build Hyper Personalized Recommendation Systems Viveak Ravichandiran
Retrieval Augmented Verilog Generation Joseph Rejive
Parsing FDA label data with LLMsJake Silberg
FAST: Finetuning Agents with Synthetic Trajectories Flor Lozano-Byrne
Diving Under the Hood: Exploring LLM Conceptual Understanding Through Latent Embeddings Kelvin Nguyen
Korean-English Neural Machine Translation with Language Style Control Jiwon Jeong, Hyejin Lee, Youjin Song
Using Segmented Novel Views, Depth, and BERT Embeddings for Training in Robot Learning Matt Strong
How Much Attention is "All You Need"? Ignacio Fernandez, Duru Irmak Unsal
A case for pre-training in Compositional Generalization tasks Ahmad Jabbar, Rhea Kapur
RubricEval - Scalable Human-LLM Evaluation of LLMs on Open-Ended Tasks Using Human-Written RubricsStella Zhang
MuRST: Multilingual Recursive Summarization Trees Tarini Mutreja, Saron Samuel, Humishka Zope
Simulating the Court: Legal Judgment Prediction through Relational Learning Ein Jun
An Exploration of Transferring Domain Expertise Jonathan Paul Hsu
Posetta: Language-Guided Protein Design Haotian Du, Jingjia Liu, Tianyu Lu
Self Reward Scaling Arjun Chandran
Optimizing Human-Agent Interaction: Evaluating Diverse Strategies for Human Input in the OptiMUS LLM Agent System Idil Defne Çekin, Isaiah Hall
A Neuro-Symbolic Integration of LLMs and SMT-solvers for Trustworthy Logical ReasoningHarun Khan
Experiments on Multi-Task Learning Framework over BERT for Performing Sentiment Analysis, Paraphrase Detection, and Semantic Textual Similarity Simultaneously Florence Chen
arXivBot Amr Sherif
Robust DPO with Convex-NN on Single GPU Miria Feng
DNACLIP: Contrastive representation learning for joint embedding of DNA and natural languageBrian Kang
Taming Guidelines in the Wild Anuj Iravane
Long Horizon Robotic Manipulation through Closed-Loop Mark-Based Visual PromptingDavid Ihim
Context-Aware Gesture Interpretation in Augmented and Virtual RealityTrishia El Chemaly
Reading Between the Minds: Context-Aware Brain-to-Text Decoding Ellie Tanimura, Sarosh Khan
Clinical Text Summarization with LLM-Based Evaluation Daphne Barretto, Matthew Jin, Bora Oztekin
Beauty and a Beat: Comparing and Combining the Utility of Lyrical and Acoustic Features to Identify Genuine PlaylistsNaomi Eigbe
Tracing the Development of Word Meaning During Training Shenghua Liu, Yiheng Ye
MoonSpeech - Training a tiny multi-modal LLMKrishna Dusad
Integrating Clinical Note Synthesis with Synthetic EHR Data for Enhanced Healthcare AnalysisJessica Yang, Riya Karumanchi
Automated Extraction and Detection of Selective Reporting in Publications of Landmark Cancer TrialsMaximilian Schuessler, Amanda Rodriguez, Selina Pi
An LLM-Based Recommender System for Scientific PapersVijay Josephs, Aaron Reed
Actions versus Objects: Understanding Gendering of Jobs through LanguageEcho Yan Zhou

Default Projects

Project nameAuthors
Enhancing minBERT for Multi-Task NLP: Architectural and Training Innovations Xinxie Wu
Enhancing multi-task fine-tuning on BERT-based model Xiaochen Xiong
Task-Specific Parameter Efficient Fine-Tuning for Improving Multitask BERT Brian K. Ryu
Adapt BERT on Multiple Downstream Tasks Ran Li
Post-Op BERT: Improving Gradient-Surgery on Imbalanced Data Giancarlo Ricci
Improving Semantic Meaning of BERT Sentence Embeddings Timothy Yao
Improving the Performance of BERT Using Contrastive Learning and Meta-Learning Akash Gupta, Justin Shen, Peter Westbrook
Fine-Tuning BERT for Multi-Task Prediction Uma Dayal
Robust Adaptation of BERT using SMARTOzgur Cetin
Improving minBERT with Conditional Layer Normalization Matan Abrams
Multi-task Learning with minBERT Praneet Bhoj
optiBERT: Fine-Tuning BERT for Optimal PerformanceJirah Taylor
Multitask Finetuning for MinBERT Ethan Boneh
ConCATenation Curiosity: Evaluating Multitask Performance of minBERT Prithvi Krishnarao, Emily Redmond
Parameter Efficient Fine-Tuning for Multi-Task BERT Zhen Wu, Genghan Zhang, Alexa Hu
Multi-task BERT Fine-Tuning with Gradient Tricks Henry Ang
PowerBERT: Improving BERT with a Power Set Ensemble of Fine-Tuned Single and Multitask Models Eric Lee, Kevin Song, Jeanette Han
PALs of MTL: Investigating Task Scheduling Algorithms in the Presence of PALs Kris Jeong, Pauline Arnoud
BERT with LORA: Low-rank Adaptation Of Large Language Models Tolu Oyeniyi
From BERT to Brilliance: An Analytical Approach to Advancing Multitask Learning for NLP Akea Pavel, Adrian Mendoza-Perez
Comparing BERT Fine-Tuning Methods Adrian Stoll, Jennifer Ho, Daniel Tyshler
Dynamic Weight Adjustment for Multitask BERT: An Approach to Sentiment, Paraphrase, and Similarity TasksMarco Pizarro
Multitask training BERT for Sentiment Analysis, Semantic Textual Similarity, and Paraphrase DetectionFeiyang Kuang
BERT Multitask Methods for Low-Parameter Fine-Tuning Elton Manchester
miniBert Unleashed John Cao, David Kwentua
Implementing and Enhancing minBERT for Optimized Performance on Multiple Downstream Classification Tasks Priti Rangnekar
Multi-BERT: Investigating Methods for BERT Multitask Learning Zach Benton
Expanding minBERT’s Scope: Integrating SimCSE, Ensemble Learning, and PANDA Yasmina Abukhadra, Samantha Liu, Hannah Norman
Parameter Efficient BERT Fine-tuning Ang Li
Hybrid BERT: Sharing Layers for Multitask Performance with Fewer Parameters Jake North, Jared Weissberg
Rhapsody on a Theme of Gradient Surgery: Variations to Improve minBERT for Multi-Task Learning Christian Femrite
minBERT Multi-Extended: Fine Tuning minBERT for Downstream Tasks Jayna Huang, Isabella Lee, Sophie Zhang
BEES: Bi-Encoder Ensembles with Simple Contrastive Learning and Smoothness Induced Adversarial Regularization Nithish Kaviyan Dhayananda Ganesh
minBERT and Downstream Tasks Bay Foley-Cox, David Wendt
Too SMART for your own good: Multitask Fine-Tuning pre-trained minBERT through Regularized Optimization and hyper-parameter optimization Proud Mpala, Wayne Chinganga
MathBERT: Increasing mathematical reasoning through Domain-Specific Fine-tuning and Optimization John Founds, Carlos Santana
Implementing and Fine-Tuning BERT for Sentiment Classification, Paraphrase Detection and Semantic Similarity Analysis Anqi Zhu, Antonio Torres Skillicorn, Kyra Sophie Kraft
BERT and Multitask Learning Sureen Heer, Collin Jung, Adrian Molofsky
A Better Multitask BERT: Improving on Fine-Tuning Andrea Hurtado, Sarah Teaw
Optimizing Multitask BERT: A Study of Sampling Methods and Advanced Training Techniques Andrew Wu, Wesley Larlarb
YourBERT: Tailoring BERT for Precision Paolo Tayag, Jack Walter
Utilizing Enhanced Deep Contextualized Word Embeddings for Downstream Tasks Renaldo Venegas, Ethan Yuen
BERT on Multitask Training: Bimodality, Ensemble, Round-robin, Text-encoding, and More Jiaxiang Ma, Yuchen Deng
Untitled Betty Wu
Cooking A Multitude of Optimizations for BERT Multi-Task Mastery Michael Cho, Michael Peter Hong, Michael Marcotte
Multi-Dimensional BERT: Bridging Versatility and Specialization in Multi-Task Learning Emma Casey, Luke Moberly
Fine-tuning BERT for Multi-task Learning Yutai Luo
Multitask and Task-specific Optimizations for minBERT James Chen, Krish Parikh
Utilizing minBERT for Multiple Sentence-level Tasks Qianhui Zheng
(Multi-gate) Mixture of ExBerts O Sub Kwon
minBERT and Downstream TasksZhihua Cai
Fine BERT Xavier Millan, Yuvraj Baheti, Andrew Nguyen
UnBERTlievable: How Extensions to BERT Perform on Downstream NLP Tasks Sophie Andrews, Naomi Boneh
PALs and MNRL: Adaptations for Multi-Task BERT Lei Yin
Multitask BERT Fine-Tuning and Generative Adversarial Learning for Auxiliary Classification Christopher Sun, Abishek Satish
SMART Fine-tuning Zikui Wang
SLOTH: Semantic Learning Optimization and Tuning Heuristics for Enhanced NLP with minBERT Phillip Miao, Cici Hou
Multitask Learning with BERT Sanjaye Elayattu
An Exploration of Multi-Task Learning over minBERT Chunming Peng, Max Yuan, Annie Wang
minBert with Cosine Similarity and PCGradXiyuan Wu, Alan Zhang
BERT: Battling Overfitting with Multitask Learning and Ensembling Javier Nieto, Annabelle Jayadinata
Strategies for Building Semantic Classification Dataset with LLM and Active Learning Jerry Chan
BERT Goes to School: Improving BERT Embeddings Through Curriculum-Based Contrastive Learning and Synonym-Based Data Augmentation Arnav Gangal, Martin Pollack, Russell Tran
Bagging the Singular Value Decomposition - A Joint Implementation of LoRA and Bootstrap Aggregating as a Fine-tuning Regime Andri Vidarsson, Jacob Thornton, Raphaëlle Ramanantsoa
Research on the Application of Deep Learning-based BERT Model with Additional Pretraining and Multitask Fine-Tuning Muran Yu, Ricky Liu
My PAL BERT: Using Projected Attention Layers and Additional Fine-Tuning Strategies to Improve BERT’s Performance on Downstream Tasks Colin Michael Sullivan, Abhishek Kumar
Enhancing Multi-Task Learning with BERT Josiah Griggs
minBERT-based Multitask Model using PAL-LoRA Xian Wu
Enhacing minBERT by Leveraging CosineEmbeddingLoss Fine-Tuning and Multi-Task Learning with Gradient Surgery Peter De La Cruz, Mohamed Musa, Yahaya Ndutu
Orthogonal Projection Loss for Multi-Headed Attention Kyle McGrath
Enhancing Dev Accuracy with DoRA Hayden Kim, Nilson Rodriguez Cadenas
SMARTer Multi-task Fine-tuning of BERT Disha Ghandwani, Aditya Ghosh, Rahul Kanekar
Tuning Up BERT: A Symphony of Strategies for Downstream Tasks Nick Soulounias
AllBERT: Mastering Multiple-Tasks Efficiently Thierry Rietsch, Joe Serrano
Beyond BERT: a Multi-Tasking Journey Through Sampling, Loss Functions, Parameter-Efficient Methods, Ensembling, and More Alycia Lee, Amanda Li
Enhancing BERT’s Performance on Downstream Tasks via Multitask Fine-Tuning and Ensembling Lianfa Li
minBert and Downstream Tasks Zhimin Tang
Multitask Contrastive Learning for Sentence RepresentationAbdulaziz Alharbi
BERT’s Got Talent: Advanced Fine-Tuning Strategies for Better BERT Generalization Grace Luo, Danny Lin
Separating Meaning From Weights in Sentence EmbeddingsHaibib Kerim
A multi-objective approach to improving accuracy and efficiency in multitask BERT Arpit Singh, Amitai Porat, Lin Ma
Supercharging MinBERT with Contrastive Learning & Self-Distillation Cécile Logé Baccari
Parameter-Efficient Adaptation of BERT using LoRA and MoE Li-Heng Lin, Yi-Ting Wu
Exploring Multi-Task Learning with Unbalanced Datasets and Gradient Surgery Julien Darve
Multitask minBERT with Parameter-efficient Fine-tuning Zhuoqi (Charlie) Zhang
Enhanced BERT Adaptation: Ensembling LoRA Models for Improved Fine-Tuning Denis Tolkunov
Parameter-Efficient Learning Strategies for Multi-Task Applications of BERT Irmak Sivgin, Mahmut Yurt
Efficient multi-task learning strategies for single BERTJiamin Sun, Xingjian Zhang
Exploring Transfer Learning and Multi-Task Learning: An Experimental Analysis of Diverse Architectures Sayali Sonawane
Improving Multi-Task BERT Fine-Tuning: Effective Methods and Practices Amy Wang, Haopeng Xue, Xinling Li
minBERT and Downstream Tasks Ziyang Ding, Daniel Zou
BERT Multitask Learning in a Semi-Supervised Learning Setup Danhua Yan
BERTille, a multitask BERT model made in France Alexis Bonnafont, Malo Sommers, Salma Zainana
Enhancing BERT:The Effects of Additional Pretraining Using Downstream Task Relevant Datasets Chase Nwamu
Strategies for Optimization of minBERT for Multi-Task Learning Ankur Jai Sood, Cameron Heskett, Shinwoo Lee
How to make your BERT model an xBERT in multitask learning? Xingshuo Xiao
ComBERT: Improving Multitask-BERT with Combinations of Scale-invariant Optimization and Parameter-efficient Fine-tuning Harper Hua, Ruiquan Gao , Xinyi(Jojo) Zhao
Fine-tune miniBERT for multi-task learning Geo Zhang
Hybrid Multitask Learning with BERT Anna Mattinger
Improving minBERT on Downstream Tasks Through Combinatorial Extensions Yichen Jiang, Ria Calcagno, Senyang Jiang
StudentBERT: Multitask Training with Teacher Models Shang Jiang, Shengtong Zhang
Balancing Act: Evaluating the Interactions Between Multi-Task Learning, Cosine Similarity, and Adversarial RegularizationLeo Glikbarg, Dhafer Faishal
Build, Extend, Repeat, Triumph: Extensions to a BERT Multitask ModelOliver Lee, Ethan Hsu, Manat Kaur
Fasting NLP:Slimming Down Models with QLoRA Ricardo Carrillo
BEST-FiT: Balancing Effective Strategies for Tri-task Fine Tuning Elijah Kim, Rohan Sanda
Extending minBERT in a Multi-Task Setting Jean-Philippe Lemay, Veljko Skarich, Joe Wang
Multi-task Learning for BERT Xinyan He, Wei Zhao
Enhancing BERT with Adapter Fusion: A Strategy for Multi-Task Knowledge Transfer Hao Xu
Combining Contrastive Learning with Layer Utility Analysis and Experimental Multi-Task Finetuning to Improve mini-BERT Performance Ellie Vela, Vionna Atef, Ethan Bogle
minBERT Bryant Mendez Melchor, Gustavo Martinez
minBERT and Downstream Tasks Jiajing Luo
Parameter Efficient Fine Tuning in Multi-Task Learning with minBERTHavin E. Hosgur
BERTolomeu: Exploring Methods to Improve Downstream Task Performance with BERT Flora Yuan, Jack Zhang
Multi-Task Learning for Language Model Fine-Tuning Anonto Zaman
PradrewBert: The Efficient One Pradyumna Saligram, Andrew Lanpouthakoun
Smarter BERT with Better Understanding and GeneralizationYujie Gao
Multitask BERT for Sentiment Analysis, Paraphrase Detection, and Semantic Textual SimilarityBrandon Ring
Triple Threat: Exploring Multi-task Training Strategies for BERT Sally Zhu, Akaash Kolluri
NeuSemble: Neural Ensemble of Multitask Learning with minBERTYouzhi (Yousef) Liang
Bidirectional Encoder Representations from Transformers (BERT) with Mixture of Experts for Sentiment Analysis, Paraphrase Detection, and Semantic Textual SimilarityDavid D. Wu
Fine-tuning BERT for Multiple Downstream Tasks Jianhao Cai
Parts of MinBERT Monica Hicks, Megan Liu
Improving the performance of miniBERT and BERTaaR (BERT as a Recruiter) Prabhjot Singh Rai, Jared Isobe
Applying Multitask Fine-Tuning to Sentence-BERT Alvin Ayuyo
Investigation of BERT Fine-tuning Strategies Sheena Lai
BERT-Based Multi-Task Learning for Natural Language Understanding Ray Ortigas
Fine-tuning and Gender Fairness of BERT Models Kevin Rizk
Improving minBert using Multi-Task Fine-Tuning with cosine-similarityEun Sun Song
Exploring Efficient Learning of Small BERT Networks with LoRA and DoRA Aditri Bhagirath, Moritz Bolling, Daniel Frees
Downsizing minBERT without hurting performanceNazar Khan
Leveraging PEFT Strategies for Improved Performance and Efficiency in minBERT Mateo Quiros Bloch, Lara Seyahi, Susan Ahmed
Mastering minBERT: A True Balancing Act Emma Escandon, Alejandro Rivas, Daniela Uribe
Minh-BERT Taran Kota

AltStyle によって変換されたページ (->オリジナル) /