Fine-Tuning - Tag - DataSci Ocean

Fine-Tuning

2025

SENSE Explained: How Strong and Weak LLMs Achieve State-of-the-Art Text-to-SQL (ACL 2024) 09-15

2024

Tuning LayerNorm in Attention: Towards Efficient Multi-Modal LLM Finetuning 07-08

GAIA: A Benchmark for General AI Assistants 06-27

Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM 04-24

Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpoints 04-10

The 3 Stages of LLM Training: A Deep Dive into Reinforcement Learning from Human Feedback (RLHF) 02-27

DPO：Direct Preference Optimization 02-27

2023

VIME Explained: Supercharge Your Tabular Models with Self-Supervised Learning 04-15