Accéder au contenu principal
AccueilPython

Spoken Language Processing in Python

Learn how to load, transform, and transcribe speech from raw audio files in Python.

Commencer Le Cours Gratuitement
4 heures14 vidéos53 exercices7 286 apprenantsTrophyDéclaration de réalisation

Créez votre compte gratuit

GoogleLinkedInFacebook

ou

En continuant, vous acceptez nos Conditions d'utilisation, notre Politique de confidentialité et le fait que vos données sont stockées aux États-Unis.
Group

Formation de 2 personnes ou plus ?

Essayer DataCamp for Business

Apprécié par les apprenants de milliers d'entreprises


Description du cours

Learn Speech Recognition and Spoken Language Processing in Python

We learn to speak far before we learn to read. Even in the digital age, our main method of communication is speech. Spoken Language Processing in Python will help you load, transform, and transcribe audio files. You’ll start by seeing what raw audio looks like in Python, and move on to exploring popular libraries and working through an example business use case.

Use Python SpeechRecognition and PyDub to Transcribe Audio Files

Python has a number of popular libraries that help you to process spoken language. SpeechRecognition offers you an easy way to integrate with speech-to-text APIs, while PyDub helps you to programmatically alter audio file attributes to get them ready for transcription. Each of these libraries is covered in an in-depth chapter, offering you the opportunity to put theory into practice to cement your knowledge.

Practice Speech Transcription with an In-Course Project

The final chapter in this course offers you the opportunity to put everything you’ve learned together by building a speech processing proof of concept for a fictional technology company. You’ll build a system that transcribes phone call audio to text and then performs sentiment analysis to review customer support phone calls.

By the end of this course, you’ll have both the knowledge and hands-on experience to put your learning into practice within your job or personal projects.
Pour les entreprises

Formation de 2 personnes ou plus ?

Donnez à votre équipe l’accès à la plateforme DataCamp complète, y compris toutes les fonctionnalités.
DataCamp Pour Les EntreprisesPour une solution sur mesure , réservez une démo.

Dans les titres suivants

Traitement du langage naturel en Python

Aller à la piste
  1. 1

    Introduction to Spoken Language Processing with Python

    Gratuit

    Audio files are different from most other types of data. Before you can start working with them, they require some preprocessing. In this chapter, you'll learn the first steps to working with speech files by converting two different audio files into soundwaves and comparing them visually.

    Jouez Au Chapitre Maintenant
    Introduction to audio data in Python
    50 xp
    The right frequency
    50 xp
    Importing an audio file with Python
    100 xp
    Converting sound wave bytes to integers
    50 xp
    The right data type
    50 xp
    Bytes to integers
    100 xp
    Finding the time stamps
    100 xp
    Visualizing sound waves
    50 xp
    Staying consistent
    50 xp
    Processing audio data with Python
    100 xp
  2. 2

    Using the Python SpeechRecognition library

    Speech recognition is still far from perfect. But the SpeechRecognition library provides an easy way to interact with many speech-to-text APIs. In this section, you'll learn how to use the SpeechRecognition library to easily start converting the spoken language in your audio files to text.

    Jouez Au Chapitre Maintenant
  3. 3

    Manipulating Audio Files with PyDub

    Not all audio files come in the same shape, size or format. Luckily, the PyDub library by James Robert provides tools which you can use to programmatically alter and change different audio file attributes such as frame rate, number of channels, file format and more. In this chapter, you'll learn how to use this helpful library to ensure all of your audio files are in the right shape for transcription.

    Jouez Au Chapitre Maintenant
Pour les entreprises

Formation de 2 personnes ou plus ?

Donnez à votre équipe l’accès à la plateforme DataCamp complète, y compris toutes les fonctionnalités.

Dans les titres suivants

Traitement du langage naturel en Python

Aller à la piste

ensembles de données

Pre- and post-purchase audio snippet transcriptions

collaborateurs

Collaborator's avatar
Hillary Green-Lerman
Collaborator's avatar
Adrián Soto
Collaborator's avatar
Maggie Matsui
Daniel Bourke HeadshotDaniel Bourke

Machine Learning Engineer and YouTube creator

Voir Plus

Qu’est-ce que les autres apprenants ont à dire ?

Inscrivez-vous 15 millions d’apprenants et commencer Spoken Language Processing in Python Aujourd’hui!

Créez votre compte gratuit

GoogleLinkedInFacebook

ou

En continuant, vous acceptez nos Conditions d'utilisation, notre Politique de confidentialité et le fait que vos données sont stockées aux États-Unis.