Papers

12094 papers

ICLR2024

Towards Best Practices of Activation Patching in Language Models: Metrics and Methods

Summary pending...

language model interpretabilityinterpretabilitymechanistic interpretability
ICLR2024

Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding

Summary pending...

Table UnderstandingIn-context LearningLarge Language Model
ICLR2024

What does the Knowledge Neuron Thesis Have to do with Knowledge?

Summary pending...

language modelknowledge neuronmodel editing
ICLR2024

Revisiting Link Prediction: a data perspective

Summary pending...

Link Prediction;Graph Neural Network
ICLR2024

Semantic Flow: Learning Semantic Fields of Dynamic Scenes from Monocular Videos

Summary pending...

3D visionNeRFsemantic understanding
ICLR2024

The Generative AI Paradox: “What It Can Create, It May Not Understand”

Summary pending...

LMslanguage modelsvision models
ICLR2024

On the Over-Memorization During Natural, Robust and Catastrophic Overfitting

Summary pending...

overfittingnatural overfittingrobust overfitting
ICLR2024

Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in RL

Summary pending...

Reinforcement learningadversarial policies
ICLR2024

Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts

Summary pending...

Text-to-3D Creation3D Content EditingAttribute Mismatching
ICLR2024

Jointly Training Large Autoregressive Multimodal Models

Summary pending...

Large Multimodal Models; Joint Training; Interleaved Image-Text Generation; Autoregressive Models
ICLR2024

Threshold-Consistent Margin Loss for Open-World Deep Metric Learning

Summary pending...

Deep metric learningOpen-world visual recognitionThreshold consistency
ICLR2024

On the Analysis of GAN-based Image-to-Image Translation with Gaussian Noise Injection

Summary pending...

Image to Image translationNoise robustnessf-divergence
ICLR2024

Stable Neural Stochastic Differential Equations in Analyzing Irregular Time Series Data

Summary pending...

Neural Ordinary Differential EquationsNeural Stochastic Differential EquationsIrregular time series data
ICLR2024

Stable Anisotropic Regularization

Summary pending...

isotropyLLMsoutlier dimensions
ICLR2024

Outliers with Opposing Signals Have an Outsized Effect on Neural Network Optimization

Summary pending...

neural network optimizationprogressive sharpeningedge of stability
ICLR2024

LUT-GEMM: Quantized Matrix Multiplication based on LUTs for Efficient Inference in Large-Scale Generative Language Models

Summary pending...

Neural Network QuantizationModel CompressionGenerative Language Model
ICLR2024

Tree Cross Attention

Summary pending...

AttentionRetrievalTree
ICLR2024

Latent Trajectory Learning for Limited Timestamps under Distribution Shift over Time

Summary pending...

Distribution ShiftTemporal Distribution Shift
ICLR2024

Pathformer: Multi-scale Transformers with Adaptive Pathways for Time Series Forecasting

Summary pending...

Time seriesTransformerMulti-scale
ICLR2024

PTaRL: Prototype-based Tabular Representation Learning via Space Calibration

Summary pending...

Tabular dataDeep neural networksTabular representation learning