Biomedical Imaging Group
Logo EPFL
    • Splines Tutorials
    • Splines Art Gallery
    • Wavelets Tutorials
    • Image denoising
    • ERC project: FUN-SP
    • Sparse Processes - Book Preview
    • ERC project: GlobalBioIm
    • The colored revolution of bioimaging
    • Deconvolution
    • SMLM
    • One-World Seminars: Representer theorems
    • A Unifying Representer Theorem
Follow us on Twitter.
Join our Github.
Masquer le formulaire de recherche
Menu
BIOMEDICAL IMAGING GROUP (BIG)
Laboratoire d'imagerie biomédicale (LIB)
  1. School of Engineering STI
  2. Institute IEM
  3.  LIB
  4.  Speaker Recognition
  • Laboratory
    • Laboratory
    • Laboratory
    • People
    • Jobs and Trainees
    • News
    • Events
    • Seminars
    • Resources (intranet)
    • Twitter
  • Research
    • Research
    • Researchs
    • Research Topics
    • Talks, Tutorials, and Reviews
  • Publications
    • Publications
    • Publications
    • Database of Publications
    • Talks, Tutorials, and Reviews
    • EPFL Infoscience
  • Code
    • Code
    • Code
    • Demos
    • Download Algorithms
    • Github
  • Teaching
    • Teaching
    • Teaching
    • Courses
    • Student projects
  • Splines
    • Teaching
    • Teaching
    • Splines Tutorials
    • Splines Art Gallery
    • Wavelets Tutorials
    • Image denoising
  • Sparsity
    • Teaching
    • Teaching
    • ERC project: FUN-SP
    • Sparse Processes - Book Preview
  • Imaging
    • Teaching
    • Teaching
    • ERC project: GlobalBioIm
    • The colored revolution of bioimaging
    • Deconvolution
    • SMLM
  • Machine Learning
    • Teaching
    • Teaching
    • One-World Seminars: Representer theorems
    • A Unifying Representer Theorem

Text Independent Speaker Recognition

P. Thévenaz

AGEN Mitteilungen, no. 52, pp. 35-45, November 1990.


The author describes an automatic speaker recognition technique, in a text independent context. By text independence, it is meant that no password has to be used, reducing the risk of loss or mimicry. To this end, four different methods are selected whose performances are individually reviewed. Then, the author shows how to combine them in order to enhance the global result. The first method characterizes a speaker by his mean cepstrum, averaged over time. The second method is based on a technique of accumulation of vector quantization error; the parameters used are also cepstral vectors. Using their time derivatives instead, the third method is produced, which is otherwise identical to the previous one. The fourth and last method exploits the histogram of entries in a universal cepstrum codebook, according to a vector quantization technique. Finally, the author combines the results by the Fisher linear discriminant analysis. It is shown by a series of telephone-bandwidth experiments, that the methods behave well except for the third method which had to be rejected due to its bad performance. However, the combination of the other three methods is even more successful than any single method.

@ARTICLE(http://bigwww.epfl.ch/publications/thevenaz9002.html,
AUTHOR="Th{\'{e}}venaz, P.",
TITLE="Text Independent Speaker Recognition",
JOURNAL="{AGEN} Mitteilungen",
YEAR="1990",
volume="52",
number="",
pages="35--45",
month="November",
note="")
© 1990 AGEN. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from AGEN. This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.
  • Laboratory
  • Research
  • Publications
    • Database of Publications
    • Talks, Tutorials, and Reviews
    • EPFL Infoscience
  • Code
  • Teaching
Logo EPFL, Ecole polytechnique fédérale de Lausanne
Emergencies: +41 21 693 3000 Services and resources Contact Map Webmaster email

Follow EPFL on social media

Follow us on Facebook. Follow us on Twitter. Follow us on Instagram. Follow us on Youtube. Follow us on LinkedIn.
Accessibility Disclaimer Privacy policy

© 2023 EPFL, all rights reserved