diffusion

Title: Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion. (arXiv:2310.02279v1 [cs.LG])

Title: FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models. (arXiv:2310.02401v1 [cs.CV])

Title: EditVal: Benchmarking Diffusion Based Text-Guided Image Editing Methods. (arXiv:2310.02426v1 [cs.CV])

Title: Generalization in diffusion models arises from geometry-adaptive harmonic representation. (arXiv:2310.02557v1 [cs.CV])

Title: SweetDreamer: Aligning Geometric Priors in 2D Diffusion for Consistent Text-to-3D. (arXiv:2310.02596v1 [cs.CV])

Title: MagicDrive: Street View Generation with Diverse 3D Geometry Control. (arXiv:2310.02601v1 [cs.CV])

Title: On Memorization in Diffusion Models. (arXiv:2310.02664v1 [cs.LG])

Title: ED-NeRF: Efficient Text-Guided Editing of 3D Scene using Latent Space NeRF. (arXiv:2310.02712v1 [cs.CV])

Title: Magicremover: Tuning-free Text-guided Image inpainting with Diffusion Models. (arXiv:2310.02848v1 [cs.CV])

Title: Boosting Dermatoscopic Lesion Segmentation via Diffusion Models with Visual and Textual Prompts. (arXiv:2310.02906v1 [cs.CV])

Title: T$^3$Bench: Benchmarking Current Progress in Text-to-3D Generation. (arXiv:2310.02977v1 [cs.CV])

Title: Probing Intersectional Biases in Vision-Language Models with Counterfactual Examples. (arXiv:2310.02988v1 [cs.CV])

Title: Efficient-3DiM: Learning a Generalizable Single-image Novel-view Synthesizer in One Day. (arXiv:2310.03015v1 [cs.CV])

Title: Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models. (arXiv:2310.03020v1 [cs.CV])

Title: Stochastic force inference via density estimation. (arXiv:2310.02366v1 [cs.LG])

Title: SE(3)-Stochastic Flow Matching for Protein Backbone Generation. (arXiv:2310.02391v1 [cs.LG])

Title: Learning to Reach Goals via Diffusion. (arXiv:2310.02505v1 [cs.LG])

Title: Ophiuchus: Scalable Modeling of Protein Structures through Hierarchical Coarse-graining SO(3)-Equivariant Autoencoders. (arXiv:2310.02508v1 [cs.LG])

Title: MedDiffusion: Boosting Health Risk Prediction via Diffusion-based Data Augmentation. (arXiv:2310.02520v1 [cs.LG])

Title: Diffusion Generative Flow Samplers: Improving learning signals through partial trajectory optimization. (arXiv:2310.02679v1 [cs.LG])

Title: Fast, Expressive SE$(n)$ Equivariant Networks through Weight-Sharing in Position-Orientation Space. (arXiv:2310.02970v1 [cs.LG])

self-supervised

Title: FroSSL: Frobenius Norm Minimization for Self-Supervised Learning. (arXiv:2310.02903v1 [cs.LG])

Title: Dual Conic Proxies for AC Optimal Power Flow. (arXiv:2310.02969v1 [cs.LG])

foundation model

Title: EGraFFBench: Evaluation of Equivariant Graph Neural Network Force Fields for Atomistic Simulations. (arXiv:2310.02428v1 [cs.LG])

Title: scHyena: Foundation Model for Full-Length Single-Cell RNA-Seq Analysis in Brain. (arXiv:2310.02713v1 [cs.LG])

Title: Multiple Physics Pretraining for Physical Surrogate Models. (arXiv:2310.02994v1 [cs.LG])

generative

Title: Improving Automatic VQA Evaluation Using Large Language Models. (arXiv:2310.02567v1 [cs.CV])

Title: Analyzing and Improving OT-based Adversarial Networks. (arXiv:2310.02611v1 [cs.LG])

Title: GETAvatar: Generative Textured Meshes for Animatable Human Avatars. (arXiv:2310.02714v1 [cs.CV])

Title: From Words to Watts: Benchmarking the Energy Costs of Large Language Model Inference. (arXiv:2310.03003v1 [cs.CL])

In this paper, we describe experiments conducted to study the computational and energy utilization of inference with LLMs. We benchmark and conduct a preliminary analysis of the inference performance and inference energy costs of different sizes of LLaMA -- a recent state-of-the-art LLM -- developed by Meta AI on two generations of popular GPUs (NVIDIA V100 \& A100) and two datasets (Alpaca and GSM8K) to reflect the diverse set of tasks/benchmarks for LLMs in research and practice. We present the results of multi-node, multi-GPU inference using model sharding across up to 32 GPUs. To our knowledge, our work is the one of the first to study LLM inference performance from the perspective of computational and energy resources at this scale.

Title: Delta-AI: Local objectives for amortized inference in sparse graphical models. (arXiv:2310.02423v1 [cs.LG])

Title: GenCO: Generating Diverse Solutions to Design Problems with Combinatorial Nature. (arXiv:2310.02442v1 [cs.LG])

Title: Dual-stage Flows-based Generative Modeling for Traceable Urban Planning. (arXiv:2310.02453v1 [cs.LG])

Title: A Recipe for Improved Certifiable Robustness: Capacity and Data. (arXiv:2310.02513v1 [cs.LG])

Title: Generative Modeling of Regular and Irregular Time Series Data via Koopman VAEs. (arXiv:2310.02619v1 [cs.LG])

Title: Local Search GFlowNets. (arXiv:2310.02710v1 [cs.LG])

Title: Expected flow networks in stochastic environments and two-player zero-sum games. (arXiv:2310.02779v1 [cs.LG])

Title: A Deep Instance Generative Framework for MILP Solvers Under Limited Data Availability. (arXiv:2310.02807v1 [cs.LG])

anomaly

Title: A Prototype-Based Neural Network for Image Anomaly Detection and Localization. (arXiv:2310.02576v1 [cs.CV])

Title: Improving Vision Anomaly Detection with the Guidance of Language Modality. (arXiv:2310.02821v1 [cs.CV])

Title: Delving into CLIP latent space for Video Anomaly Recognition. (arXiv:2310.02835v1 [cs.CV])

Title: ARRQP: Anomaly Resilient Real-time QoS Prediction Framework with Graph Convolution. (arXiv:2310.02269v1 [cs.LG])

Title: Expert enhanced dynamic time warping based anomaly detection. (arXiv:2310.02280v1 [cs.LG])

Title: Rayleigh Quotient Graph Neural Networks for Graph-level Anomaly Detection. (arXiv:2310.02861v1 [cs.LG])

Title: ELUQuant: Event-Level Uncertainty Quantification in Deep Inelastic Scattering. (arXiv:2310.02913v1 [cs.LG])

in-context

Title: DQ-LoRe: Dual Queries with Low Rank Approximation Re-ranking for In-Context Learning. (arXiv:2310.02954v1 [cs.CL])

Title: Understanding In-Context Learning in Transformers and LLMs by Learning to Learn Discrete Functions. (arXiv:2310.03016v1 [cs.LG])

memory

Title: Land-cover change detection using paired OpenStreetMap data and optical high-resolution imagery via object-guided Transformer. (arXiv:2310.02674v1 [cs.CV])

Title: Dynamic Shuffle: An Efficient Channel Mixture Method. (arXiv:2310.02776v1 [cs.CV])

Title: Unsupervised Speech Recognition with N-Skipgram and Positional Unigram Matching. (arXiv:2310.02382v1 [cs.CL])

Title: Mixture of Quantized Experts (MoQE): Complementary Effect of Low-bit Quantization and Robustness. (arXiv:2310.02410v1 [cs.LG])

Title: ResidualTransformer: Residual Low-rank Learning with Weight-sharing for Transformer Layers. (arXiv:2310.02489v1 [cs.CL])

Title: DON-LSTM: Multi-Resolution Learning with DeepONets and Long Short-Term Memory Neural Networks. (arXiv:2310.02491v1 [cs.LG])

few-shot

Title: NOLA: Networks as Linear Combination of Low Rank Random Basis. (arXiv:2310.02556v1 [cs.CL])

Title: SHOT: Suppressing the Hessian along the Optimization Trajectory for Gradient-Based Meta-Learning. (arXiv:2310.02751v1 [cs.LG])

Title: Multimodal Question Answering for Unified Information Extraction. (arXiv:2310.03017v1 [cs.CL])