September-Assistant - Open-source Windows Voice Assistant

Last update: Nov 22, 2022

Overview

September - Windows Assistant

September is an open-source Windows personal assistant built-in python. Read How to Setup? for additional information on setting up the application.

🏆 Install

Download Zip File

🔮 Features

who.are.you.mp4

📚 Overview

September uses	For
Google Speech Recognition	Speech recognition
Pyttsx3	Text to Speech
Tkinter	GUI
Wikipedia API	Wiki related search
Wolfram Alpha	Processing queries

💻 Code Walkthrough

main.py contains the functions for initializing Tkinter-based windows and converting speech to text.
- The listening process and tkinter frame refresh process run concurrently using threads.
- Each window is started as a separate thread for them to work parallelly.
- Start and Stop Listening sounds are played using the playsound module.
- The input from the user is taken either as a text from the command entry box or as audio from mic button
- The input is converted to text using SpeechRecognition module and passed to processtext function of processtext.py
processtext.py contains the function in which the command processing happens.
- The input obtained from the user in the main function is passed to the processtext function. It has an if-else ladder to search for specific keywords in the input. If the query does not match with the keywords found in the ladder, it is passed to wolfresponse.py
- Functions like opening apps, searching web happens here.
wikiresponse.py and wolfresponse.py are the places where the respective APIs are accessed to process the output of the unmatched queries from processtext.py
texttospeech.py uses the pyttsx3 module to convert speech to text.
- After processing the query, text to speech function is called.
- Esc Key is added as a keybind to stop text to speech for one iteration by calling texttospeech_stop function.
config_data.json contains the values that are displayed in the September app settings in JSON format. It is used for storing windows application paths, wake word and the API key.
requirements.txt contains the list of all the modules used in this program.

🌐 Dependencies

requirements.txt contains all the python modules required by the program.
All the assets used by the program are present in assets folder. The application won't function as intended without these assets.
Built in Python 3.10.1. Use python 3.0 or greater versions for a better experience.
Requires active internet connection.
Make sure that your antivirus doesn't block the program from uploading or downloading audio stream data. (Disable antivirus :D)
Additionally, the Pyaudio module must be installed by following the below instructions.

Install Pyaudio

Find your Python version using in your terminal

python --version

Find the appropriate .whl (wheel) file at Pythonlibs and download it.
Go to the folder where it is downloaded and install the .whl file using pip,
For example, if you download the wheel file for Python 3.7 64-bit, your pip command would be,

pip install PyAudio-0.2.11-cp37-cp37m-win_amd64.whl

The wheel file for Python 3.10 64-bit is present in dependencies/PyAudio-0.2.11-cp310-cp310-win_amd64.whl
Visit for more information

📝 License

You might also like...

ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for which no expressive speech corpus is available.

ERISHA: Multilingual Multispeaker Expressive Text-to-Speech Library ERISHA is a multilingual multispeaker expressive speech synthesis framework. It ca

43 Nov 27, 2022

An evaluation toolkit for voice conversion models.

Voice-conversion-evaluation An evaluation toolkit for voice conversion models. Sample test pair Generate the metadata for evaluating models. The direc

30 Aug 29, 2022

Pytorch Implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension)

DiffSinger - PyTorch Implementation PyTorch implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension). Status

152 Jan 2, 2023

Automatic voice-synthetised summaries of latest research papers on arXiv

PaperWhisperer PaperWhisperer is a Python application that keeps you up-to-date with research papers. How? It retrieves the latest articles from arXiv

124 Dec 20, 2022

Phonetic PosteriorGram (PPG)-Based Voice Conversion (VC)

ppg-vc Phonetic PosteriorGram (PPG)-Based Voice Conversion (VC) This repo implements different kinds of PPG-based VC models. Pretrained models. More m

227 Dec 28, 2022

Here is the implementation of our paper S2VC: A Framework for Any-to-Any Voice Conversion with Self-Supervised Pretrained Representations.

S2VC Here is the implementation of our paper S2VC: A Framework for Any-to-Any Voice Conversion with Self-Supervised Pretrained Representations. In thi

81 Dec 15, 2022

Releases(v1.0.1)

v1.0.1(Feb 19, 2022)
Added Windows Installer

If windows defender flags this as a virus or tries to delete the .exe, .dll files (ofc is a false positive detection), try these solutions,

Trust the file on Windows Defender settings.

Disable Windows Defender (buy an antivirus :D, defender gives false positive detections most of the time).

Read this for more information.

Link to the solution provided in Microsoft Forum.

Install from the .zip file instead of the windows setup file.

Source code(tar.gz)
Source code(zip)
Septemberv1.0.1-Setup.exe(38.28 MB)
Septemberv1.0.1-Setup.zip(46.10 MB)
v1.0.0(Feb 17, 2022)
Added .zip file for installation.

The main executable (September.exe) is in the root folder

If windows defender flags the app as a virus or tries to delete the .exe, .dll files (ofc is a false positive detection), try these solutions,

Trust the file on Windows Defender settings.

Disable Windows Defender.

Read this for more information.

Link to the solution provided in Microsoft Forum.

Source code(tar.gz)
Source code(zip)
September.zip(46.10 MB)

September-Assistant - Open-source Windows Voice Assistant

Related tags

Overview

September - Windows Assistant

🏆 Install

🔮 Features

📚 Overview

💻 Code Walkthrough

🌐 Dependencies

Install Pyaudio

📝 License

You might also like...

ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for which no expressive speech corpus is available.

An evaluation toolkit for voice conversion models.

Pytorch Implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension)

Automatic voice-synthetised summaries of latest research papers on arXiv

Phonetic PosteriorGram (PPG)-Based Voice Conversion (VC)

Here is the implementation of our paper S2VC: A Framework for Any-to-Any Voice Conversion with Self-Supervised Pretrained Representations.

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion

Voice of Pajlada with model and weights.

GuideDog is an AI/ML-based mobile app designed to assist the lives of the visually impaired, 100% voice-controlled

Releases(v1.0.1)

v1.0.1(Feb 19, 2022)

v1.0.0(Feb 17, 2022)

Owner

The Nithin Balaji

Band-Adaptive Spectral-Spatial Feature Learning Neural Network for Hyperspectral Image Classification

MIMIC Code Repository: Code shared by the research community for the MIMIC-III database

In this repo we reproduce and extend results of Learning in High Dimension Always Amounts to Extrapolation by Balestriero et al. 2021

PSML: A Multi-scale Time-series Dataset for Machine Learning in Decarbonized Energy Grids

A scientific and useful toolbox, which contains practical and effective long-tail related tricks with extensive experimental results

The Official PyTorch Implementation of "LSGM: Score-based Generative Modeling in Latent Space" (NeurIPS 2021)

SparseML is a libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

AbelNN: Deep Learning Python module from scratch

Segmentation models with pretrained backbones. Keras and TensorFlow Keras.

Official PyTorch implementation of "Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image", ICCV 2019

Official release of MSHT: Multi-stage Hybrid Transformer for the ROSE Image Analysis of Pancreatic Cancer axriv: http://arxiv.org/abs/2112.13513

Resco: A simple python package that report the effect of deep residual learning

Attentional Focus Modulates Automatic Finger‑tapping Movements

Notebooks for my "Deep Learning with TensorFlow 2 and Keras" course

Official implementation of the paper 'High-Resolution Photorealistic Image Translation in Real-Time: A Laplacian Pyramid Translation Network' in CVPR 2021

This repository is for Competition for ML_data class

Bag of Tricks for Natural Policy Gradient Reinforcement Learning

Python script that takes an Impulse response .wav and a input .wav to demonstrate audio convolution.

code for paper "Does Unsupervised Architecture Representation Learning Help Neural Architecture Search?"

Python implementation of Wu et al (2018)'s registration fusion