diffusion

Title: On Manipulating Scene Text in the Wild with Diffusion Models. (arXiv:2311.00734v1 [cs.CV])

Title: Towards High-quality HDR Deghosting with Conditional Diffusion Models. (arXiv:2311.00932v1 [cs.CV])

Title: Bridging the Gap: Addressing Discrepancies in Diffusion Model Training for Classifier-Free Guidance. (arXiv:2311.00938v1 [cs.LG])

Title: Gaussian Mixture Solvers for Diffusion Models. (arXiv:2311.00941v1 [cs.LG])

Title: Optimal Noise pursuit for Augmenting Text-to-Video Generation. (arXiv:2311.00949v1 [cs.CV])

Title: VideoDreamer: Customized Multi-Subject Text-to-Video Generation with Disen-Mix Finetuning. (arXiv:2311.00990v1 [cs.CV])

Title: Act As You Wish: Fine-Grained Control of Motion Diffusion Model with Hierarchical Semantic Graphs. (arXiv:2311.01015v1 [cs.CV])

Title: Learning Unsupervised World Models for Autonomous Driving via Discrete Diffusion. (arXiv:2311.01017v1 [cs.CV])

Title: Expanding Expressiveness of Diffusion Models with Limited Data via Self-Distillation based Fine-Tuning. (arXiv:2311.01018v1 [cs.CV])

Title: Infusion: Internal Diffusion for Video Inpainting. (arXiv:2311.01090v1 [cs.CV])

Title: Optimal Transport-Guided Conditional Score-Based Diffusion Models. (arXiv:2311.01226v1 [cs.CV])

Title: DP-Mix: Mixup-based Data Augmentation for Differentially Private Learning. (arXiv:2311.01295v1 [cs.LG])

Title: The Blessing of Randomness: SDE Beats ODE in General Diffusion-based Image Editing. (arXiv:2311.01410v1 [cs.CV])

Title: Tipping Points of Evolving Epidemiological Networks: Machine Learning-Assisted, Data-Driven Effective Modeling. (arXiv:2311.00797v1 [cs.LG])

Title: Non-Autoregressive Diffusion-based Temporal Point Processes for Continuous-Time Long-Term Event Prediction. (arXiv:2311.01033v1 [cs.LG])

Title: Add and Thin: Diffusion for Temporal Point Processes. (arXiv:2311.01139v1 [cs.LG])

Title: Diffusion Models for Reinforcement Learning: A Survey. (arXiv:2311.01223v1 [cs.LG])

self-supervised

Title: Are These the Same Apple? Comparing Images Based on Object Intrinsics. (arXiv:2311.00750v1 [cs.CV])

Title: Concatenated Masked Autoencoders as Spatial-Temporal Learner. (arXiv:2311.00961v1 [cs.CV])

Title: Terrain-Informed Self-Supervised Learning: Enhancing Building Footprint Extraction from LiDAR Data with Limited Annotations. (arXiv:2311.01188v1 [cs.CV])

Title: Language Model Training Paradigms for Clinical Feature Embeddings. (arXiv:2311.00768v1 [cs.LG])

Title: COSTAR: Improved Temporal Counterfactual Estimation with Self-Supervised Learning. (arXiv:2311.00886v1 [cs.LG])

Title: VIGraph: Self-supervised Learning for Class-Imbalanced Node Classification. (arXiv:2311.01191v1 [cs.LG])

Title: Combating Bilateral Edge Noise for Robust Link Prediction. (arXiv:2311.01196v1 [cs.LG])

Title: Unreading Race: Purging Protected Features from Chest X-ray Embeddings. (arXiv:2311.01349v1 [cs.LG])

Materials and Methods: An orthogonalization is utilized to remove the influence of protected features (e.g., age, sex, race) in chest radiograph embeddings, ensuring feature-independent results. To validate the efficacy of the approach, we retrospectively study the MIMIC and CheXpert datasets using three pre-trained models, namely a supervised contrastive, a self-supervised contrastive, and a baseline classifier model. Our statistical analysis involves comparing the original versus the orthogonalized embeddings by estimating protected feature influences and evaluating the ability to predict race, age, or sex using the two types of embeddings.

Results: Our experiments reveal a significant influence of protected features on predictions of pathologies. Applying orthogonalization removes these feature effects. Apart from removing any influence on pathology classification, while maintaining competitive predictive performance, orthogonalized embeddings further make it infeasible to directly predict protected attributes and mitigate subgroup disparities.

Conclusion: The presented work demonstrates the successful application and evaluation of the orthogonalization technique in the domain of chest X-ray classification.

foundation model

Title: Multimodal Foundation Models for Zero-shot Animal Species Recognition in Camera Trap Images. (arXiv:2311.01064v1 [cs.CV])

Title: Recognize Any Regions. (arXiv:2311.01373v1 [cs.CV])

Title: Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models. (arXiv:2311.01441v1 [cs.LG])

Title: Generating QM1B with PySCF$_{\text{IPU}}$. (arXiv:2311.01135v1 [cs.LG])

generative

Title: PET Tracer Conversion among Brain PET via Variable Augmented Invertible Network. (arXiv:2311.00735v1 [cs.LG])

Title: Detecting Generated Images by Real Images Only. (arXiv:2311.00962v1 [cs.CV])

Title: A Chronological Survey of Theoretical Advancements in Generative Adversarial Networks for Computer Vision. (arXiv:2311.00995v1 [cs.CV])

Title: Novel View Synthesis from a Single RGBD Image for Indoor Scenes. (arXiv:2311.01065v1 [cs.CV])

Title: Semantic Scene Graph Generation Based on an Edge Dual Scene Graph and Message Passing Neural Network. (arXiv:2311.01192v1 [cs.CV])

Title: Robust Identity Perceptual Watermark Against Deepfake Face Swapping. (arXiv:2311.01357v1 [cs.CV])

Title: Multi-dimensional data refining strategy for effective fine-tuning LLMs. (arXiv:2311.01049v1 [cs.CL])

Title: Generative Input: Towards Next-Generation Input Methods Paradigm. (arXiv:2311.01166v1 [cs.CL])

Title: People Make Better Edits: Measuring the Efficacy of LLM-Generated Counterfactually Augmented Data for Harmful Language Detection. (arXiv:2311.01270v1 [cs.CL])

Title: Better Together: Enhancing Generative Knowledge Graph Completion with Language Models and Neighborhood Information. (arXiv:2311.01326v1 [cs.CL])

Title: Monotone Generative Modeling via a Gromov-Monge Embedding. (arXiv:2311.01375v1 [cs.LG])

Title: Identifying Alzheimer Disease Dementia Levels Using Machine Learning Methods. (arXiv:2311.01428v1 [cs.LG])

anomaly

Title: Cheating Depth: Enhancing 3D Surface Anomaly Detection via Depth Simulation. (arXiv:2311.01117v1 [cs.CV])

in-context

Title: Pretraining Data Mixtures Enable Narrow Model Selection Capabilities in Transformer Models. (arXiv:2311.00871v1 [cs.LG])

memory

Title: Like an Open Book? Read Neural Network Architecture with Simple Power Analysis on 32-bit Microcontrollers. (arXiv:2311.01344v1 [cs.CR])

Title: Zero Coordinate Shift: Whetted Automatic Differentiation for Physics-informed Operator Learning. (arXiv:2311.00860v1 [cs.LG])

few-shot

Title: Learning to Adapt CLIP for Few-Shot Monocular Depth Estimation. (arXiv:2311.01034v1 [cs.CV])

Title: Multi-view Relation Learning for Cross-domain Few-shot Hyperspectral Image Classification. (arXiv:2311.01212v1 [cs.CV])

Title: Calibrated Seq2seq Models for Efficient and Generalizable Ultra-fine Entity Typing. (arXiv:2311.00835v1 [cs.CL])