1. Speech Analysis (SPE-ANLS)
  Spectral and other time-frequency analysis techniques
  Segmental and suprasegmental analysis
  Distortion measures
  Extraction of non-linguistic information (e.g., gender, stress, etc)
  Voice/speech disorders
  Speaker localization (space) (e.g., in meetings)
  Speaker diarization (time) (e.g., in meetings)
  Speaker clustering (e.g., in Broadcast news)

2. Speech Enhancement (SPE-ENHA)
  Control and reduction of channel noise (e.g., reverb, room response)
  Perceptual enhancement of non-noisy speech
  Speech enhancement for humans with hearing impairments
  Non-acoustic microphones for enhancement
  Bandwidth expansion
  Noise Reduction

3. Acoustic Modeling for Automatic Speech Recognition (SPE-RECO)
  Feature Extraction
  Low-level feature modeling - Gaussians & beyond
  Pronunciation modeling at the acoustic level
  State clustering and novel state definitions
  Prosody and other speech characteristics
  Dialect, accent, and idiolect at the acoustic level
  Discriminative Acoustic Training Methods for ASR
  Articulatory and physiological modeling
  Feature Transformation and Normalization

4. Robust Speech Recognition (SPE-ROBU)
  Features specifically for robust ASR (noise, channel, etc)
  Model/backend based robust ASR
  Confidence measures and rejection
  Speech Activity/End-point detection
  Barge-in
  Non-acoustic microphones for ASR

5. Speech Adaptation/Normalization (SPE-ADAP)
  Speaker adaptation and normalization (e.g., VTLN)
  Speaker adapted training methods
  Environmental/Channel adaptation
  Idiolect adaptation
  Register and/or dialect adaptation

6. General Topics in Speech Recognition (SPE-GASR)
 Distributed Speech Recognition - Client/Server methods
 Alternative Statistical/Machine Learning Methods (e.g., no HMMs)
 Word spotting
 Metadata (e.g., emotion, speaker, accent) extraction from acoustics
 New algorithms, computational strategies, data-structures for ASR
 Multi-modal (such as audio-visual) speech recognition
 Corpora, annotation, and other resources
 Algorithm approximation methods in ASR
 Structured classification approaches

7. Multilingual Recognition and Identification (SPE-MULT)
 Language (LID) and dialect (DID) identification
 Multilingual Speech recognition
 Processing of non-native accents

8. Lexical Modeling and Access (SPE-LEXI)
 Pronunciation modeling at the lexical level
 Dialect, accent, and idiolect at the lexical level
 Multilingual aspects (e.g., unit selection)
 Automatic lexicon learning

9. Large Vocabulary Continuous Recognition/Search (SPE-LVCR)
 Decoding algorithms and implementation
 Lattices
 Multi-pass strategies
 Miscellaneous Topics

10. Speaker Recognition and Characterization (SPE-SPKR)
 Features and characteristics for speaker recognition
 Robustness to variable and degraded channels
 Verification, identification, segmentation, and clustering
 Speaker characterization and adaptation
 Speaker recognition with speech recognition
 Speaker confidence estimation
 Multimodal and multimedia human speaker recognition
 Corpora, annotation, evaluation, and other resources
 Higher-level knowledge in speaker recognition

11. Resource constrained speech recognition (SPE-RCSR)
 Low-power speech recognition
 Reduced computation speech recognition
 ASR techniques for highly portable/mobile devices

12. Spoken Language Understanding (SLP-UNDE)
 Paralinguistic (emotion, age, gender, rate, etc.) information
 Nonlinguistic (meaning external to language) information, gestures, etc.
 Semantic classification
 Question/answering from speech
 Entity extraction from speech
 Spoken document summarization
 Detecting linguistic/discourse structure (e.g., disfluencies, sentence/topic boundaries, speech acts)
 Relation to and interpretation of sign language

13. Human Spoken Language Acquisition, Development and Learning (SLP-LADL)
 Language acquisition, development, and learning models
 Computer aids for language learning
 Attributes and modeling techniques for assessment of language fluency

14. Spoken and Multimodal Dialog Systems and Applications (SLP-SMMD)
 Spoken and multimodal dialog systems, applications, and architectures
 Stochastic Learning for dialog modeling
 Response Generation
 Technologies for the aged
 Evaluation metrics and standards
 Speech/voice-based human-computer interfaces (HCI)
 Speech HCI for individuals with impairments (blindness, etc.) and universal access (UA)
 other applications

15. Speech data mining and Document Retrieval (SLP-SMIR)
 Analysis and Evaluations for mining spoken data
 Search/retrieval of speech documents
 Mining heterogeneous speech and multimedia data
 Speech data mining theory, algorithms, and methods
 Core machine learning algorithms for data mining
 Topic spotting and classification
 Pattern discovery and prediction from data
 Applications and tools for speech data mining

16. Machine Translation of Speech (SLP-SSMT)
 Semi-automatic and data driven methods
 Speech processing for MTS
 Corpora, annotation, and other resources
 Interlingua and transfer approaches
 Integration of speech and linguistic processing
 Machine transliteration for named entities
 Evaluation metrics (e.g., BLEU)
 Systems and applications for MTS

17. Language Modeling, for Speech and SLP (SLP-LANG)
 N-grams, their generalizations and smoothing methods.
 Language Model Adaptation
 Grammar based language modeling
 Maxent and feature based language modeling
 Dialect, accent, and idiolect at the language level
 Discriminative LM Training Methods
 Other approaches to LMs
 Structured classification approaches

18. Spoken language resources and annotation (SLP-REAN)
 General corpora, annotation, and other resources

19. Other Applications of Speech Recognition and Understanding