Papers

12094 papers

ICLR2025

Effective post-training embedding compression via temperature control in contrastive training

Summary pending...

representation learningembeddingstext retrieval
ICLR2025

Large (Vision) Language Models are Unsupervised In-Context Learners

Summary pending...

llmunsupervisedin-context learning
ICLR2025

Latent Safety-Constrained Policy Approach for Safe Offline Reinforcement Learning

Summary pending...

Safe RLOffline RLVariational Autoencoders
ICLR2025

Mitigating Memorization in Language Models

Summary pending...

language modelsmemorizationmachine unlearning
ICLR2025

Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling

Summary pending...

reinforcement learningmodel-based reinforcement learningoptimistic exploration
ICLR2025

Reducing Hallucinations in Large Vision-Language Models via Latent Space Steering

Summary pending...

Large Vision-Language ModelsMultimodal large language modelHallucination
ICLR2025

Nova: Generative Language Models for Assembly Code with Hierarchical Attention and Contrastive Learning

Summary pending...

large language modelhierarchical attentioncontrastive learning
ICLR2025

Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling

Summary pending...

Large Language Models;Length Extrapolation;Efficiency;Hybrid State Space Models
ICLR2025

Palu: KV-Cache Compression with Low-Rank Projection

Summary pending...

KV-CacheLow-Rank CompressionLarge Language Model
ICLR2025

Bounds on $L_p$ Errors in Density Ratio Estimation via $f$-Divergence Loss Functions

Summary pending...

density ratio estimationvariational divergence optimizationKullback–Leibler divergence
ICLR2025

Shared-AE: Automatic Identification of Shared Subspaces in High-dimensional Neural and Behavioral Activity

Summary pending...

Computational neuroscienceMultimodalSocial behavior
ICLR2025

HelpSteer2-Preference: Complementing Ratings with Preferences

Summary pending...

reward modellingrlhfmodel alignment
ICLR2025

The 3D-PC: a benchmark for visual perspective taking in humans and machines

Summary pending...

3D visionvisual cognitiondevelopmental psychology
ICLR2025

Controllable Context Sensitivity and the Knob Behind It

Summary pending...

analysisinterpretabilitymechanistic interpretability
ICLR2025

Explore Theory of Mind: program-guided adversarial data generation for theory of mind reasoning

Summary pending...

theory of mind reasoningadversarial data generationprogram-guided data generation
ICLR2025

Optimizing 4D Gaussians for Dynamic Scene Video from Single Landscape Images

Summary pending...

Dynamic Scene Video4D Gaussian
ICLR2025

Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models

Summary pending...

Multimodal Large Language ModelFederated Prompt LearningPersonalization
ICLR2025

Sparse components distinguish visual pathways & their alignment to neural networks

Summary pending...

visual representationsalignmentsparse decomposition
ICLR2025

NatureLM-audio: an Audio-Language Foundation Model for Bioacoustics

Summary pending...

audio-language foundation modelsmultimodal large language models (llms)bioacoustics
ICLR2025

GOttack: Universal Adversarial Attacks on Graph Neural Networks via Graph Orbits Learning

Summary pending...

graphletorbitadversarial machine learning