Direkt zum Inhalt

Startseite Python

Spoken Language Processing in Python

Learn how to load, transform, and transcribe speech from raw audio files in Python.

Kurs Kostenlos Starten

4 Stunden14 Videos53 Übungen7.291 LernendeLeistungsnachweis

Kostenloses Konto erstellen

Google LinkedIn Facebook

oder

Durch Klick auf die Schaltfläche akzeptierst du unsere Nutzungsbedingungen, unsere Datenschutzrichtlinie und die Speicherung deiner Daten in den USA.

Trainierst du 2 oder mehr?

Versuchen DataCamp for Business

Beliebt bei Lernenden in Tausenden Unternehmen

Kursbeschreibung

Learn Speech Recognition and Spoken Language Processing in Python

We learn to speak far before we learn to read. Even in the digital age, our main method of communication is speech. Spoken Language Processing in Python will help you load, transform, and transcribe audio files. You’ll start by seeing what raw audio looks like in Python, and move on to exploring popular libraries and working through an example business use case.

Use Python SpeechRecognition and PyDub to Transcribe Audio Files

Python has a number of popular libraries that help you to process spoken language. SpeechRecognition offers you an easy way to integrate with speech-to-text APIs, while PyDub helps you to programmatically alter audio file attributes to get them ready for transcription. Each of these libraries is covered in an in-depth chapter, offering you the opportunity to put theory into practice to cement your knowledge.

Practice Speech Transcription with an In-Course Project

The final chapter in this course offers you the opportunity to put everything you’ve learned together by building a speech processing proof of concept for a fictional technology company. You’ll build a system that transcribes phone call audio to text and then performs sentiment analysis to review customer support phone calls.

By the end of this course, you’ll have both the knowledge and hands-on experience to put your learning into practice within your job or personal projects.

Für Unternehmen

Trainierst du 2 oder mehr?

Verschaffen Sie Ihrem Team Zugriff auf die vollständige DataCamp-Plattform, einschließlich aller Funktionen.

In den folgenden Tracks

Natürliche Sprachverarbeitung in Python

1
Introduction to Spoken Language Processing with Python
Kostenlos
Audio files are different from most other types of data. Before you can start working with them, they require some preprocessing. In this chapter, you'll learn the first steps to working with speech files by converting two different audio files into soundwaves and comparing them visually.
Kapitel Jetzt Abspielen
Introduction to audio data in Python
50 xp
The right frequency
50 xp
Importing an audio file with Python
100 xp
Converting sound wave bytes to integers
50 xp
The right data type
50 xp
Bytes to integers
100 xp
Finding the time stamps
100 xp
Visualizing sound waves
50 xp
Staying consistent
50 xp
Processing audio data with Python
100 xp
2
Using the Python SpeechRecognition library
Speech recognition is still far from perfect. But the SpeechRecognition library provides an easy way to interact with many speech-to-text APIs. In this section, you'll learn how to use the SpeechRecognition library to easily start converting the spoken language in your audio files to text.
Kapitel Jetzt Abspielen
SpeechRecognition Python library
50 xp
Pick the wrong speech_recognition API
50 xp
Using the SpeechRecognition library
100 xp
Using the Recognizer class
100 xp
Reading audio files with SpeechRecognition
50 xp
From AudioFile to AudioData
100 xp
Recording the audio we need
100 xp
Dealing with different kinds of audio
50 xp
Different kinds of audio
100 xp
Multiple Speakers 1
100 xp
Multiple Speakers 2
100 xp
Working with noisy audio
100 xp
3
Manipulating Audio Files with PyDub
Not all audio files come in the same shape, size or format. Luckily, the PyDub library by James Robert provides tools which you can use to programmatically alter and change different audio file attributes such as frame rate, number of channels, file format and more. In this chapter, you'll learn how to use this helpful library to ensure all of your audio files are in the right shape for transcription.
Kapitel Jetzt Abspielen
Introduction to PyDub
50 xp
Import an audio file with PyDub
100 xp
Play an audio file with PyDub
100 xp
Audio parameters with PyDub
100 xp
Adjusting audio parameters
100 xp
Manipulating audio files with PyDub
50 xp
Turning it down... then up
100 xp
Normalizing an audio file with PyDub
100 xp
Chopping and changing audio files
100 xp
Splitting stereo audio to mono with PyDub
100 xp
Converting and saving audio files with PyDub
50 xp
Exporting and reformatting audio files
100 xp
Manipulating multiple audio files with PyDub
100 xp
An audio processing workflow
100 xp
4
Processing text transcribed from spoken language
In this chapter, you'll put everything you've learned together by building a speech processing proof of concept project for a technology company, Acme Studios. You'll start by transcribing customer support call phone call audio snippets to text. Then you'll perform sentiment analysis using NLTK, named entity recognition using spaCy and text classification using scikit-learn on the transcribed text.
Kapitel Jetzt Abspielen
Creating transcription helper functions
50 xp
Converting audio to the right format
100 xp
Finding PyDub stats
100 xp
Transcribing audio with one line
100 xp
Using the helper functions you've built
100 xp
Sentiment analysis on spoken language text
50 xp
Analyzing sentiment of a phone call
100 xp
Sentiment analysis on formatted text
100 xp
Named entity recognition on transcribed text
50 xp
Named entity recognition in spaCy
100 xp
Creating a custom named entity in spaCy
100 xp
Classifying transcribed speech with Sklearn
50 xp
Preparing audio files for text classification
100 xp
Transcribing phone call excerpts
100 xp
Organizing transcribed phone call data
100 xp
Create a spoken language text classifier
100 xp
Congratulations!
50 xp

Für Unternehmen

Trainierst du 2 oder mehr?

Verschaffen Sie Ihrem Team Zugriff auf die vollständige DataCamp-Plattform, einschließlich aller Funktionen.

In den folgenden Tracks

Natürliche Sprachverarbeitung in Python

Datensätze

Pre- and post-purchase audio snippet transcriptions

Mitwirkende

Hillary Green-Lerman

Adrián Soto

Maggie Matsui

Voraussetzungen

Introduction to Natural Language Processing in Python Supervised Learning with scikit-learn

Machine Learning Engineer and YouTube creator

Was sagen andere Lernende?

Melden Sie sich an 15 Millionen Lernende und starten Sie Spoken Language Processing in Python Heute!

Kostenloses Konto erstellen

Google LinkedIn Facebook

oder

Durch Klick auf die Schaltfläche akzeptierst du unsere Nutzungsbedingungen, unsere Datenschutzrichtlinie und die Speicherung deiner Daten in den USA.