diffusion

Title: DAVIS: High-Quality Audio-Visual Separation with Generative Diffusion Models. (arXiv:2308.00122v1 [cs.CV])

Title: InFusion: Inject and Attention Fusion for Multi Concept Zero Shot Text based Video Editing. (arXiv:2308.00135v1 [cs.CV])

Title: Diffusion Model for Camouflaged Object Detection. (arXiv:2308.00303v1 [cs.CV])

Title: Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models. (arXiv:2308.00675v1 [cs.CL])

Title: DiffusAL: Coupling Active Learning with Graph Diffusion for Label-Efficient Node Classification. (arXiv:2308.00146v1 [cs.LG])

self-supervised

Title: Visual Geo-localization with Self-supervised Representation Learning. (arXiv:2308.00090v1 [cs.CV])

Title: Relational Contrastive Learning for Scene Text Recognition. (arXiv:2308.00508v1 [cs.CV])

Title: Predicting masked tokens in stochastic locations improves masked image modeling. (arXiv:2308.00566v1 [cs.CV])

Title: AnyLoc: Towards Universal Visual Place Recognition. (arXiv:2308.00688v1 [cs.CV])

Title: EEG-based Cognitive Load Classification using Feature Masked Autoencoding and Emotion Transfer Learning. (arXiv:2308.00246v1 [cs.LG])

foundation model

Title: Lowis3D: Language-Driven Open-World Instance-Level 3D Scene Understanding. (arXiv:2308.00353v1 [cs.CV])

generative

Title: Controlling Geometric Abstraction and Texture for Artistic Images. (arXiv:2308.00148v1 [cs.CV])

Title: Domain Adaptation based on Human Feedback for Enhancing Generative Model Denoising Abilities. (arXiv:2308.00307v1 [cs.CV])

Title: Generative Models as a Complex Systems Science: How can we make sense of large language model behavior?. (arXiv:2308.00189v1 [cs.LG])

Despite the ever increasing number of benchmarks that measure task performance, we lack explanations of what behaviors language models exhibit that allow them to complete these tasks in the first place. We argue for a systematic effort to decompose language model behavior into categories that explain cross-task performance, to guide mechanistic explanations and help future-proof analytic research.

Title: ZRIGF: An Innovative Multimodal Framework for Zero-Resource Image-Grounded Dialogue Generation. (arXiv:2308.00400v1 [cs.CL])

Title: Reinforcement Learning for Generative AI: State of the Art, Opportunities and Open Research Challenges. (arXiv:2308.00031v1 [cs.LG])

Title: Graph Contrastive Learning with Generative Adversarial Network. (arXiv:2308.00535v1 [cs.LG])

anomaly

Title: Patch-wise Auto-Encoder for Visual Anomaly Detection. (arXiv:2308.00429v1 [cs.CV])

Title: PressureTransferNet: Human Attribute Guided Dynamic Ground Pressure Profile Transfer using 3D simulated Pressure Maps. (arXiv:2308.00538v1 [cs.CV])

Title: Using Kernel SHAP XAI Method to optimize the Network Anomaly Detection Model. (arXiv:2308.00074v1 [cs.LG])

Title: A Survey of Time Series Anomaly Detection Methods in the AIOps Domain. (arXiv:2308.00393v1 [cs.LG])

in-context

Title: Reasoning before Responding: Integrating Commonsense-based Causality Explanation for Empathetic Response Generation. (arXiv:2308.00085v1 [cs.CL])

Title: Skills-in-Context Prompting: Unlocking Compositionality in Large Language Models. (arXiv:2308.00304v1 [cs.CL])