Papers

12094 papers

ICLR2025

GOttack: Universal Adversarial Attacks on Graph Neural Networks via Graph Orbits Learning

Summary pending...

graphletorbitadversarial machine learning
ICLR2025

NutriBench: A Dataset for Evaluating Large Language Models in Nutrition Estimation from Meal Descriptions

Summary pending...

Large Language ModelsNutrition EstimationDataset and Benchmark
ICLR2025

The Value of Sensory Information to a Robot

Summary pending...

roboticslimited sensingperception
ICLR2025

Fitting Networks with a Cancellation Trick

Summary pending...

Network analysisDCBMlogit-DCBM
ICLR2025

Holistically Evaluating the Environmental Impact of Creating Language Models

Summary pending...

machine learningartificial intelligencelanguage model
ICLR2025

When does compositional structure yield compositional generalization? A kernel theory.

Summary pending...

compositional generalizationrule learningkernel regression
ICLR2025

Chunk-Distilled Language Modeling

Summary pending...

language modelingtext generationretrieval-augmented generation
ICLR2025

TorchTitan: One-stop PyTorch native solution for production ready LLM pretraining

Summary pending...

large language modelsdistributed trainingpre-training
ICLR2025

Solving hidden monotone variational inequalities with surrogate losses

Summary pending...

Variational InequalityOptimizationSurrogate
ICLR2025

CTSyn: A Foundation Model for Cross Tabular Data Generation

Summary pending...

Foundation ModelTabular DataSynthetic Data Generation
ICLR2025

Provable Convergence Bounds for Hybrid Dynamical Sampling and Optimization

Summary pending...

langevinacceleratorssampling
ICLR2025

Context-Parametric Inversion: Why Instruction Finetuning May Not Actually Improve Context Reliance

Summary pending...

Instruction finetuningcontext-vs-parametric reliance
ICLR2025

Differentiable Optimization of Similarity Scores Between Models and Brains

Summary pending...

similarity measuresrepresentational alignmentprocrustes distance
ICLR2025

Aioli: A Unified Optimization Framework for Language Model Data Mixing

Summary pending...

data mixinglanguage modelsdata curation
ICLR2025

SFS: Smarter Code Space Search improves LLM Inference Scaling

Summary pending...

LLMcode generationoptimization
ICLR2025

Provable Uncertainty Decomposition via Higher-Order Calibration

Summary pending...

uncertainty quantificationcalibrationtrustworthy ML
ICLR2025

Scaling Laws for Precision

Summary pending...

quantizationscaling lawsprecision
ICLR2025

Answer, Assemble, Ace: Understanding How LMs Answer Multiple Choice Questions

Summary pending...

interpretability; multiple-choice question answering
ICLR2025

Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo

Summary pending...

Sequential Monte CarloLanguage ModelsSemantic parsing
ICLR2025

APE: Faster and Longer Context-Augmented Generation via Adaptive Parallel Encoding

Summary pending...

Parallel Encoding; Context-Augmented LLM; Efficient Inference; Length Extrapolation