Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Berrak Sisman | Lattice

Berrak Sisman

Papers on Lattice

4

Total citations

0

Topics

4

h-index

26

Research focus

Speech & Audio (4)Natural Language Processing (3)Red-Teaming & Adversarial Robustness (1)Interpretability & Mechanistic Interp (1)

Frequent co-authors

Aurosweta Mahapatra (2)Ismail Rasim Ulgen (2)Nicholas Andrews (2)Hsiang Yeh (1)

Papers (4)

Apr 15, 2026

Hsiang Yeh +5Apr 15, 2026

Who is Speaking or Who is Depressed? A Controlled Study of Speaker Leakage in Speech-Based Depression Detection

Depression detection models may be learning *who* is speaking, not *how* depression manifests in speech, inflating reported accuracy.

Hsiang Yeh, Luqi Sun, Aurosweta Mahapatra +3

Natural Language Processing Speech & Audio

Apr 14, 2026

Aurosweta Mahapatra +7Apr 14, 2026·also JHU

ProSDD: Learning Prosodic Representations for Speech Deepfake Detection against Expressive and Emotional Attacks

Training on real speech prosody alone can cut speech deepfake error rates by over 70% on emotional attacks, a blindspot for current detectors.

Aurosweta Mahapatra, Aurosweta Mahapatra, Ismail Rasim Ulgen +5

Red-Teaming & Adversarial Robustness Speech & Audio

Mar 18, 2026

Xiutian Zhao +4Mar 18, 2026

Neuron-Level Emotion Control in Speech-Generative Large Audio-Language Models

Control the emotional tone of generated speech without any training by directly manipulating specific neurons within large audio-language models.

Xiutian Zhao, Ismail Rasim Ulgen, Philipp Koehn +2

Interpretability & Mechanistic Interp Natural Language Processing Speech & Audio

Mar 9, 2026

Mar 9, 2026

Universal Speech Content Factorization

Achieve zero-shot voice conversion competitive with methods requiring more data or training, using a simple, invertible linear method to disentangle speech content from speaker timbre.

Henry Li Xinyuan, Zexin Cai, Leibny Paola Garc'ia-Perera +4

Natural Language Processing Speech & Audio

Luqi Sun (1)

Aurosweta Mahapatra (1)