Department of Electrical and Computer Engineering

MENUMENU
  • Department
      • Profile
      • Faculty
      • Evaluation
      • Administration
      • Staff
  • Studies
    • Subject Areas
    • Undergraduate Studies
    • Postgraduate Studies
      • MSc Studies in “Science and Technology of ECE”
      • MSc Studies in “Smart Grid Energy Systems”
      • MSc Studies in “Applied Informatics”
    • PhD Studies
    • Course List
      • Undergraduate Courses
      • Postgraduate Courses
        • Science and Technology of ECE
        • Smart Grid Energy Systems
        • Applied Informatics
      • Erasmus
    • ECTS
    • Career Opportunities
    • Practise Training
  • Research
    • Labs
    • Research Projects
    • Postdoc Research
    • Ph.D. Candidates
    • Theses – Technical Reports
    • Active Research Projects

      MLSysOps: Machine Learning for Autonomic System Operation in the Heterogeneous Edge-Cloud Continuum

      Scientific Responsible

      Spyros LalisSpyros Lalis, Professor
      E-mail: lalis@e-ce.uth.gr

      TitleMLSysOps: Machine Learning for Autonomic System Operation in the Heterogeneous Edge-Cloud Continuum
      Duration2023 – 2025
      Sitehttps://csl.e-ce.uth.gr/projects/mlsysops

      Read More

  • Alumni
    • Ph.D. Graduates
  • Service Offices
    • Secretariat
    • Technical support
  • Announcements
    • General Announcements
    • Academic News
  • Contact
    • Department of Electrical and Computer Engineering
      • Sekeri – Cheiden Str
        Pedion Areos, ECE Building
        383 34 Volos – Greece
      Tel.+30 24210 74967, +30 24210 74934
      e-mailgece ΑΤ e-ce.uth.gr
      PGS Tel.+30 24210 74933
      PGS e-mailpgsec ΑΤ e-ce.uth.gr
      URLhttps://www.e-ce.uth.gr/contact-info/?lang=en
  • Login

ECE443 Speech and Audio Processing

Home » Studies » Undergraduate Studies » Undergraduate Courses » ECE443 Speech and Audio Processing
Subject AreaSignals, Communications, and Networking
SemesterSemester 7 – Fall
TypeElective
Teaching Hours4
ECTS6
Prerequisites
  • ECE218 Signals and Systems
Recommended Courses
  • ECE334 Pattern Recognition
Course Director

Gerasimos PotamianosGerasimos Potamianos, Associate Professor
E-mail: gpotamianos@e-ce.uth.gr

Course Instructor
  • Aikaterini Papadimitriou, Academic Teaching Experience
    E-mail: aipapadimitriou@uth.gr
  • Description
  • Learning Outcomes

The course covers basic concepts in speech and audio processing, with its main focus being human speech, in particular its production, perception, representation, coding, synthesis, and recognition. In addition, processing of audio signals, in particular of music signals, is also covered. In summary, the course covers the following topics:

  • Introduction to digital speech processing.
  • A brief review of fundamentals of digital signal processing.
  • Fundamentals of human speech production and sound propagation in the human vocal tract.
  • Hearing, auditory models, and speech perception.
  • Time-domain methods for speech processing.
  • Frequency domain representation.
  • Homomorphic speech processing and cepstrum.
  • Linear predictive analysis of speech signals.
  • Algorithms for estimating speech parameters.
  • Digital coding of speech signals.
  • Frequency domain coding of speech and audio.
  • Text-to-speech synthesis.
  • Automatic speech recognition using hidden Markov models.
  • Feature extraction and recognition of music signals.
  • Basic computational tools in Matlab corresponding to the above (including the MIR and OpenSMILE toolboxes).
  • Brief introduction to the hidden Markov model toolkit (HTK).

This course introduces students to the basic concepts and algorithms in speech and audio processing, with its main focus being human speech, but also covering more general audio signals, in particular music ones. The course also provides numerous examples to allow student familiarization with the above, as well as practical computational tools within the Matlab and HTK software frameworks, further demonstrating these.

The course provides further specialization to the students, as a continuation of the digital signal processing and pattern recognition courses, allowing them to further delve into the study of the specific signals (speech, audio).

Students successfully completing this class will have mastered the main concepts, algorithms, and tools in the processing and recognition of speech and more general audio signals. For example, they will be able to:

  • Understand the process of human speech production and perception.
  • Extract appropriate features from speech signals in various domains and select the most suitable among them for the particular problem at hand.
  • Be able to perform speech recognition and speech synthesis with basic algorithms.
  • Extract a variety of features from music signals.
  • Implement programs in Matlab / OpenSMILE / HTK to perform the aforementioned tasks.

e-Yπηρεσίες

Contact Info

  • Sekeri – Cheiden Str, Pedion Areos, Volos
  • +30 24210 74967
  • +30 24210 74934
  • Email: gece@e-ce.uth.gr

Announcements

  • Academic News

Find us

  • Facebook
  • Twitter
  • Youtube
  • Linkedin
© Copyright 2025 Department of Electrical and Computer Engineering
We use cookies to ensure that we give you the best experience on our website.OKΠληροφορίες