Skip to content

Shadowsith/qpicospeaker

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

60 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Build status

QPicoSpeaker

QPicoSpeaker is a Qt text to speech (TTS) GUI for the pico2wave console application.


Motivation

I had sound problems with the Gespeaker UI for espeak and I also was unhappy with the robotic sounds from espeak.
As better solution for the unstatisfied TTS situation on the Linux desktop I develop this application.

Third party requirements:

  • qt5 + qt5-multimedia
  • pico2wave
  • sox
  • mplayer for Google TTS

Install requirements:

Debian/Ubuntu

sudo apt-get install sox mplayer libttspico-utils

For building you also need qt5 + multimedia packages:
sudo apt-get install qt5-default qtmultimedia5-dev

  • git clone https://github.com/Shadowsith/qpicospeaker
  • cd qpicospeaker
  • sudo ./install.sh

Install paths:

  • Binary: /usr/bin/qpicospeaker
  • Config: /home/<user>/.config/qpicospeaker/config.xml

To uninstall QPicospeaker write ./install.sh --uninstall

If the project is in a more finished state there will be a package in the Ubuntu launchpad too.

Arch Linux

There is a offical maintained package in the AUR: yay -S qpicospeaker-git

Tested Systems

  • Manjaro/Arch Linux (KDE)
  • Ubuntu (MATE) 18.04

Supported languages (TTS):

  • English (US)
  • English (UK)
  • German
  • Spain
  • French
  • Italian

Implemented features

  • Write a text in the textinput and let it say from the TTS engine
  • Change speak, loudness and pitch form the TTS software
  • Load/save textfile form/to textinput field
  • Save output file to the location what you want
  • Logo/Icon
  • Connection to Google Translate TTS engine

Upcomming features

  • Default Settings
  • Translation to other languages
  • All Languages for Google TTS
  • removing mplayer to curl
  • resizing UI
  • Help documentation

Future planned

  • eSpeak speech synthesizer as alternative to pico2wave and Google
  • IBM Watson TTS support

Restictions

  • no male voice for Google TTS (no acces about connection string possible)

Screenshots

License

  • Project is free software and licensed under GPLv3+
  • Logo and GIMP Project are public domain (CC0)