Papers

12094 papers

ICLR2025

Deconstructing What Makes a Good Optimizer for Autoregressive Language Models

Summary pending...

optimizationLLMslanguage models
ICLR2025

Brain-inspired $L_p$-Convolution benefits large kernels and aligns better with visual cortex

Summary pending...

Lp-ConvolutionReceptive FieldMultivariate p-generalized normal distribution
ICLR2025

Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages

Summary pending...

MultilingualMultimodalLLMs
ICLR2025

Can LLMs Really Learn to Translate a Low-Resource Language from One Grammar Book?

Summary pending...

llmstranslationlow-resource
ICLR2025

Exponential Topology-enabled Scalable Communication in Multi-agent Reinforcement Learning

Summary pending...

multi-agent reinforcement learningcommunication
ICLR2025

Aligning Visual Contrastive learning models via Preference Optimization

Summary pending...

contrastive learningpreference optimizationalignment
ICLR2025

MANTRA: The Manifold Triangulations Assemblage

Summary pending...

simplicial complextopological deep learninghigh-order
ICLR2025

EqNIO: Subequivariant Neural Inertial Odometry

Summary pending...

equivarianceinertial odometrysubequivariance
ICLR2025

Weighted Point Set Embedding for Multimodal Contrastive Learning Toward Optimal Similarity Metric

Summary pending...

contrastive learningrepresentation learningmultimodal representation learning
ICLR2025

LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation models

Summary pending...

3D foundation modelmodel specializationrobust optimization
ICLR2025

Small-to-Large Generalization: Training Data Influences Models Consistently Across Scale

Summary pending...

data attribution
ICLR2025

Large Language Models Assume People are More Rational than We Really are

Summary pending...

Large Language ModelsRationalityCognitive Models
ICLR2025

Can Transformers Do Enumerative Geometry?

Summary pending...

AI for MathematicsAlgebraic GeometryTheorem Discovery
ICLR2025

Robustness Auditing for Linear Regression: To Singularity and Beyond

Summary pending...

Robust machine learninglinear regressionrobustness auditing
ICLR2025

MamKO: Mamba-based Koopman operator for modeling and predictive control

Summary pending...

Mamba; Koopman operator; model predictive control; nonlinear systems
ICLR2025

Local Steps Speed Up Local GD for Heterogeneous Distributed Logistic Regression

Summary pending...

optimizationconvex optimizationdistributed optimization
ICLR2025

Selective Unlearning via Representation Erasure Using Domain Adversarial Training

Summary pending...

approximate unlearningadversarial training
ICLR2025

LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations

Summary pending...

hallucinationstruthfulnessinterpretability
ICLR2025

Scaling up Masked Diffusion Models on Text

Summary pending...

Masked Diffusion ModelsScaling LawsConditional Generation
ICLR2025

Union-over-Intersections: Object Detection beyond Winner-Takes-All

Summary pending...

localization based feature representationintersection over unionobject detection.