Speech to Text Design Project

This is a project made by a student group from McGill University as a design course project. Team member: Robyn Chen, Fei Feng, Yan Ren, Even Wang

This project would not have been possible without the help and kind support of our supervisors and the Radio-Canada Digital R&D Lab team. We would like to express our deepest appreciation to Xavier K. Richard and Thomas Le Jouan for their guidance and supervision as well as for providing support throughout the entire part one of this design project.

Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes.

Prerequisites

What things you need to install the software and how to install them

MongoDB - The backend database used
NodeJS - The web framework used

Database setup

In app.js, find the code

mongoose.connect('mongodb://localhost:27017/test', { useMongoClient: true });

Replace with the database you are using.

Create two collections:

media
subs

Server setup

Install the dependencies

npm install

Start the backend

node app.js

Google Cloud Platform setup

Create Google Cloud Platform account and activate Cloud Storage and Cloud Speech API.

After Google Cloud Platform account, store the JSON key file in project root directory.

In /controller/storage.js, find the code

const bucketName = 'speech-to-text-sandbox';
const keyFileLocation = path.join(__dirname, '..', 'speech-to-text-sandbox-9b4c51ccdb39.json');

Replace with your JSON key file and cloud storage bucket.

In /controller/stt.js, find the code

const keyFileLocation = path.join(__dirname, '..', 'speech-to-text-sandbox-9b4c51ccdb39.json');

Replace with you JSON key file.

Test the project

Use .flac mono channel file to test the project. Or use the file in /testingAudioFile for simple testing.

https://cloud.google.com/speech/reference/rest/v1/RecognitionConfig#AudioEncoding

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
controllers		controllers
doc		doc
helpers		helpers
models		models
output		output
public		public
test		test
testingAudioFile		testingAudioFile
uploads		uploads
views		views
.env		.env
.gitignore		.gitignore
README.md		README.md
Speech-to-Text-DesignProject_v1_2017.zip		Speech-to-Text-DesignProject_v1_2017.zip
app.js		app.js
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech to Text Design Project

Getting Started

Prerequisites

Database setup

Server setup

Google Cloud Platform setup

Test the project

About

Releases

Packages

Contributors 2

Languages

yan-ren/speech-to-text-design-project

Folders and files

Latest commit

History

Repository files navigation

Speech to Text Design Project

Getting Started

Prerequisites

Database setup

Server setup

Google Cloud Platform setup

Test the project

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages