Biomedical Imaging Group
Logo EPFL
    • Splines Tutorials
    • Splines Art Gallery
    • Wavelets Tutorials
    • Image denoising
    • ERC project: FUN-SP
    • Sparse Processes - Book Preview
    • ERC project: GlobalBioIm
    • The colored revolution of bioimaging
    • Deconvolution
    • SMLM
    • One-World Seminars: Representer theorems
    • A Unifying Representer Theorem
Follow us on Twitter.
Join our Github.
Masquer le formulaire de recherche
Menu
BIOMEDICAL IMAGING GROUP (BIG)
Laboratoire d'imagerie biomédicale (LIB)
  1. School of Engineering STI
  2. Institute IEM
  3.  LIB
  4.  Student Projects
  • Laboratory
    • Laboratory
    • Laboratory
    • People
    • Jobs and Trainees
    • News
    • Events
    • Seminars
    • Resources (intranet)
    • Twitter
  • Research
    • Research
    • Researchs
    • Research Topics
    • Talks, Tutorials, and Reviews
  • Publications
    • Publications
    • Publications
    • Database of Publications
    • Talks, Tutorials, and Reviews
    • EPFL Infoscience
  • Code
    • Code
    • Code
    • Demos
    • Download Algorithms
    • Github
  • Teaching
    • Teaching
    • Teaching
    • Courses
    • Student projects
  • Splines
    • Teaching
    • Teaching
    • Splines Tutorials
    • Splines Art Gallery
    • Wavelets Tutorials
    • Image denoising
  • Sparsity
    • Teaching
    • Teaching
    • ERC project: FUN-SP
    • Sparse Processes - Book Preview
  • Imaging
    • Teaching
    • Teaching
    • ERC project: GlobalBioIm
    • The colored revolution of bioimaging
    • Deconvolution
    • SMLM
  • Machine Learning
    • Teaching
    • Teaching
    • One-World Seminars: Representer theorems
    • A Unifying Representer Theorem

Students Projects

Proposals  On-Going  Completed  

Learning-Based Attenuation of the Noise of a Speech Recording

Autumn 2016
Master Diploma
Project: 00319

00319

Linear predictive coding (LPC) is a time-honored reversible decomposition of speech in two components. One component encodes formants, which describe a combination of the configuration of the speaker's vocal tract—and perhaps the acoustics of the room. This component is also related to what is perceptively relevant to a human ear, particularly in terms of the categorization of vowels. The other component, called residue, encodes pitch and the transients of the speech signal, among them consonants.

In this project, the student will first apply LPC to the clean section of the speech recording of a single speaker. He will then establish two dictionaries, one for formants and one for residues. This particular recording happens to also contain a dirty section that we want to cleanup. To do so, the raw formants and residues of the dirty section will be replaced by their nearest entry within the dictionaries learned from the clean section. In doing so, we hope to improve the perceptual quality where needed.

The prerequisites for this project are a good mastering of signal processing and linear algebra. The work will consist of theoretical and algorithmic developments, as well as their application to speech data.

  • Supervisors
  • Denis Fortun, denis.fortun@epfl.ch, 35136, BM 4.138
  • Michael Unser, michael.unser@epfl.ch, 021 693 51 75, BM 4.136
  • Laboratory
  • Research
  • Publications
  • Code
  • Teaching
    • Courses
    • Student projects
Logo EPFL, Ecole polytechnique fédérale de Lausanne
Emergencies: +41 21 693 3000 Services and resources Contact Map Webmaster email

Follow EPFL on social media

Follow us on Facebook. Follow us on Twitter. Follow us on Instagram. Follow us on Youtube. Follow us on LinkedIn.
Accessibility Disclaimer Privacy policy

© 2023 EPFL, all rights reserved