Papers
12094 papers
ICLR2025
Deconstructing What Makes a Good Optimizer for Autoregressive Language Models
Summary pending...
optimizationLLMslanguage models
ICLR2025
Brain-inspired $L_p$-Convolution benefits large kernels and aligns better with visual cortex
Summary pending...
Lp-ConvolutionReceptive FieldMultivariate p-generalized normal distribution
ICLR2025
Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages
Summary pending...
MultilingualMultimodalLLMs
ICLR2025
Can LLMs Really Learn to Translate a Low-Resource Language from One Grammar Book?
Summary pending...
llmstranslationlow-resource
ICLR2025
Exponential Topology-enabled Scalable Communication in Multi-agent Reinforcement Learning
Summary pending...
multi-agent reinforcement learningcommunication
ICLR2025
Aligning Visual Contrastive learning models via Preference Optimization
Summary pending...
contrastive learningpreference optimizationalignment
ICLR2025
MANTRA: The Manifold Triangulations Assemblage
Summary pending...
simplicial complextopological deep learninghigh-order
ICLR2025
EqNIO: Subequivariant Neural Inertial Odometry
Summary pending...
equivarianceinertial odometrysubequivariance
ICLR2025
Weighted Point Set Embedding for Multimodal Contrastive Learning Toward Optimal Similarity Metric
Summary pending...
contrastive learningrepresentation learningmultimodal representation learning
ICLR2025
LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation models
Summary pending...
3D foundation modelmodel specializationrobust optimization
ICLR2025
Small-to-Large Generalization: Training Data Influences Models Consistently Across Scale
Summary pending...
data attribution
ICLR2025
Large Language Models Assume People are More Rational than We Really are
Summary pending...
Large Language ModelsRationalityCognitive Models
ICLR2025
Can Transformers Do Enumerative Geometry?
Summary pending...
AI for MathematicsAlgebraic GeometryTheorem Discovery
ICLR2025
Robustness Auditing for Linear Regression: To Singularity and Beyond
Summary pending...
Robust machine learninglinear regressionrobustness auditing
ICLR2025
MamKO: Mamba-based Koopman operator for modeling and predictive control
Summary pending...
Mamba; Koopman operator; model predictive control; nonlinear systems
ICLR2025
Local Steps Speed Up Local GD for Heterogeneous Distributed Logistic Regression
Summary pending...
optimizationconvex optimizationdistributed optimization
ICLR2025
Selective Unlearning via Representation Erasure Using Domain Adversarial Training
Summary pending...
approximate unlearningadversarial training
ICLR2025
LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations
Summary pending...
hallucinationstruthfulnessinterpretability
ICLR2025
Scaling up Masked Diffusion Models on Text
Summary pending...
Masked Diffusion ModelsScaling LawsConditional Generation
ICLR2025
Union-over-Intersections: Object Detection beyond Winner-Takes-All
Summary pending...
localization based feature representationintersection over unionobject detection.