|
|
 |
 |
|
IEEE Transactions on
AUDIO, SPEECH & LANGUAGE PROCESSING
|
 |
 |
|
EDICs Categories
SPEECH PROCESSING
- SPE-SPRD Speech Production
Physical models of the vocal production system;
bioacoustics and medical acoustics;
singing and properties of the musical voice.
- SPE-SPER Speech Perception and Psychoacoustics
Models of Speech Perception;
hearing and psychoacoustics;
physiological models and applications thereof;
audiology applications.
- SPE-ANLS Speech Analysis
Spectral and other time-frequency analysis techniques;
segmental and suprasegmental analysis;
distortion measures;
extraction of non-linguistic information (e.g., gender, stress, etc);
voice/speech disorders;
speaker localization (space) (e.g., in meetings);
speaker diarization (time) (e.g., in meetings);
speaker clustering (e.g., in Broadcast news).
- SPE-SYNT Speech Synthesis and Generation
Segmental-level and/or concatenative synthesis;
signal processing/statistical model for synthesis;
articulatory synthesis;
parametric synthesis;
prosody, emotional, and expressive synthesis;
text-to-phoneme conversion;
voice quality/morphing;
audio/visual speech synthesis;
multilingual synthesis;
quality assessent/evaluation metrics in synthesis;
tools and data for speech synthesis;
text processing for speech synthesis (text normalization, syntactic and semantic analysis).
- SPE-CODI Speech Coding
Narrow-band and wide-band speech coding;
theory and techniques for signal coding (e.g., waveform, transform);
modulation and source/channel coding;
quantization and compression;
robust coding for noisy channels;
coding for Voice Over IP (VOIP);
quality assessent/evaluation metrics (e.g., PESQ) in coding;
new applications of VOIP.
- SPE-ENHA Speech Enhancement
Control and reduction of channel noise (e.g., reverb, room response);
perceptual enhancement of non-noisy speech;
speech enhancement for humans with hearing impairments;
non-acoustic microphones for enhancement;
bandwidth expansion;
noise reduction.
- SPE-RECO Acoustic Modeling for Automatic Speech Recognition
Feature extraction;
low-level feature modeling - Gaussians & beyond;
pronunciation modeling;
state clustering and novel state definitions;
prosody and other speech characteristics;
dialect, accent, and idiolect at the acoustic level;
discriminative acoustic training methods for ASR;
articulatory and physiological modeling;
non-acoustic microphones for ASR;
feature transformation and normalization.
- SPE-ROBU Robust Speech Recognition
Features specifically for robust ASR (noise, channel, etc);
model/backend based robust ASR;
confidence measures and rejection;
speech activity/end-point detection;
barge-in.
- SPE-ADAP Speech Adaptation/Normalization
Speaker adaptation and normalization (e.g., VTLN);
speaker adapted training methods;
environmental/channel adaptation;
idiolect adaptation;
register and/or dialect adaptation.
- SPE-GASR General Topics in Speech Recognition
Distributed Speech Recognition - Client/Server methods;
alternative Statistical/Machine Learning Methods (e.g., no HMMs);
word spotting;
metadata (e.g., emotion, speaker, accent) extraction from acoustics;
new algorithms, computational strategies, data-structures for ASR;
multi-modal (such as audio-visual) speech recognition;
corpora, annotation, and other resources;
algorithm approximation methods in ASR;
structured classification approaches.
- SPE-MULT Multilingual Recognition and Identification
Language and dialect identification;
multilingual Speech recognition;
processing of non-native accents.
- SPE-LEXI Lexical Modeling and Access
Pronunciation modeling at the lexical level;
dialect, accent, and idiolect at the lexical level;
multilingual aspects (e.g., unit selection);
automatic lexicon learning.
- SPE-LVCR Large Vocabulary Continuous Recognition/Search
Decoding algorithms and implementation;
lattices;
multi-pass strategies;
miscellaneous Topics.
- SPE-SPKR Speaker Recognition and Characterization
Features and characteristics for speaker recognition;
robustness to variable and degraded channels;
verification, identification, segmentation, and clustering;
speaker characterization and adaptation;
speaker recognition with speech recognition;
speaker confidence estimation;
multimodal and multimedia human speaker recognition;
corpora, annotation, evaluation, and other resources;
higher-level knowledge in speaker recognition.
- SPE-RCSR Resource Constrained Speech Recognition
Low-power speech recognition;
reduced computation speech recognition;
ASR techniques for highly portable/mobile devices.
SPOKEN LANGUAGE PROCESSING
- SLP-UNDE Spoken Language Understanding
Paralinguistic (emotion, age, gender, rate, etc.) information;
nonlinguistic (meaning external to language) information, gestures, etc.;
semantic classification;
question/answering from speech;
entity extraction from speech;
spoken document summarization;
detecting linguistic/discourse structure (e.g., disfluencies, sentence/topic boundaries, speech acts);
relation to and interpretation of sign language.
- SLP-LADL Human Language Acquisition, Development and Learning
Language acquisition, development, and learning models;
computer aids for language learning;
attributes and modeling techniques for assessment of language fluency.
- SLP-SMMD Spoken and Multimodal Dialog Systems
Spoken and multimodal dialog systems, applications, and architectures;
stochastic Learning for dialog modeling;
response Generation;
technologies for the aged;
evaluations and standardizations;
speech/voice-based human-computer interfaces (HCI);
speech HCI for individuals with impairments and universal access (UA);
other applications.
- SLP-SMIR Speech Data Mining and Document Retrieval
Analysis and Evaluations for mining spoken data;
search/retrieval of speech documents;
mining heterogeneous speech and multimedia data;
speech data mining theory, algorithms, and methods;
core machine learning algorithms for data mining;
topic spotting and classification;
pattern discovery and prediction from data;
applications and tools for speech data mining.
- SLP-SSMT Machine Translation of Speech
Semi-automatic and data driven methods;
speech processing for MTS;
corpora, annotation, and other resources;
interlingua and transfer approaches;
integration of speech and linguistic processing;
machine transliteration for named entity;
evaluation metrics (e.g., BLEU);
systems and applications for MTS.
- SLP-LANG Language Modeling (for Speech and SLP)
N-grams, their generalizations and smoothing methods;
language model adaptation;
grammar based language modeling;
maxent and feature based language modeling;
dialect, accent, and idiolect at the language level;
discriminative LM training methods;
other approaches to LMs;
structured classification approaches.
- SLP-REAN Spoken Language Resources and Annotation
General corpora, annotation, and other resources
AUDIO AND ELECTROACOUSTICS
- AUD-ROOM Room Acoustics and Acoustic System Modeling
Room acoustics and acoustic system modeling;
room response measurement, modeling, simulation and compensation;
architectural and physical acoustics;
physical modeling of musical instruments;
room acoustics for music performance and reproduction.
- AUD-TRAN Transducers
Transducer modeling and design;
transducer calibration and compensation;
novel transducers.
- AUD-LMAP Loudspeaker and Microphone Array Signal Processing
Far-field and near-field beamforming and array processing;
source localization and tracking;
time-delay estimation;
audio enhancement using transducer arrays;
wavefield synthesis;
sound field analysis and synthesis.
- AUD-ANCO Active Noise Control
Acoustic noise cancellation and suppression;
adaptive techniques for feedforward control;
feedback control algorithms;
multichannel systems.
- AUD-ECHO Echo Cancellation
Single-channel and multichannel acoustic echo cancellation;
echo path estimation and modeling;
echo suppression and dereverberation;
double-talk detection;
adaptive filter theory for audio applications.
- AUD-AUDI Auditory Modeling and Hearing Aids
Human audition and psychoacoustics;
computational auditory scene analysis;
perceptual and psychophysical models of audio algorithms and systems;
perceptual measures of audio quality;
aids for the handicapped;
medical aids (cochlear implants, hearing aids);
binaural hearing.
- AUD-SSEN Audio Source Separation and Enhancement
Single-channel and multichannel source separation;
blind deconvolution;
noise reduction, compensation, and equalization;
audio denoising and restoration.
- AUD-SMCA Spatial and Multichannel Audio
Spatial sound analysis and reproduction;
spatialization and virtualization;
measurement, modeling, and use of head-related transfer functions;
crosstalk cancellation and binaural synthesis;
artificial reverberation algorithms.
- AUD-ACOD Audio Coding
Low bit-rate and high-quality audio coding;
scalable and lossless audio coding;
spatial audio coding;
joint source-channel coding;
signal representations for coding;
parametric and structured audio coding;
psychoacoustic models for coding;
objective and subjective quality assessment;
error detection, correction, and concealment.
- AUD-ANSY Audio Analysis and Synthesis
Music analysis, modification, and synthesis;
models and representations for musical signals;
pitch and multi-pitch estimation;
audio feature analysis and extraction;
melody, note, chord, key, and rhythm estimation and detection;
automatic transcription.
- AUD-CONT Content-Based Music Processing
Audio identification, indexing, and retrieval;
music semantics;
segmentation and speech/music discrimination;
audio characterization, classification, and categorization;
music thumbnailing and automatic summarization;
music fingerprinting;
music information retrieval;
data mining.
- AUD-AUMM Audio for Multimedia
Audio watermarking and data hiding;
data encryption, security, and privacy;
digital rights management;
joint processing of audio and video;
human-machine audio interfaces;
auditory displays;
distant learning and virtual reality.
- AUD-NWAU Network Audio
Audio processing for network distribution;
packet loss detection, correction, and concealment;
network audio quality assessment;
distributed processing of audio.
- AUD-SYST Audio Processing Systems
Hardware and software systems and implementations;
consumer and professional audio.
|
 |
 |
|