About the Project

The Generative AI (GenAI) supported Note Takers App is an innovative tool designed to help users capture and organize notes from spoken conversations. This smart application uses advanced technology to record audio, convert speech into text, and then refine these notes using Generative AI like Google Germini and OpenAI GPT.

Users can choose to use the built-in speech-to-text function to convert their recorded lecture/meeting audio into text directly within their devices. In such case, users don't need to worry about sending the audio files to another place on the Internet. On the other hand in the case the users would like to use server-side speech recongition feature due to the built-in speech-to-text model is not always performing so well like a larger and mature open source model, the users also no need to worry about leaking any audio recording in the transit.

It is because the Note Takers project incorporates Public Key Infrasture (PKI), Secure Sockets Layer (SSL) and salted secured hash algorithm to propose, designe, and implement a privacy-protected and secure transmission mechanics and workflow that ensure:

  • (1) the audio file's chunks transferring from the app to the server cannot be decrypted and the audio content cannot be interpreted by anyone else;
  • (2) the stored chunks on the server waiting for speech-to-text process is protected and cannot be read besides the server itself;
  • (3) the speech-to-text process on the server is a stateless process that transcribes chunks on-the-fly;
  • (4) once a encrypted chunk is transcribed, the chunk is deleted from the server;
  • (5) the stored text transcripts on the server is encrypted and protected so no one else in the world (even the server itself) other than the sender app can read; and,
  • (6) once the server passes a stored encrypted transcript back to the app, the transcript is deleted from the server.
VIP Research Group's Note Takers app is perfect for anyone who needs to keep track of lectures, meetings, or any spoken content without manually writing down every word. The app ensures that all data, from audio files to text transcripts, is stored securely, using encryption and secure connections to protect user privacy.

This project combines various technologies to create a seamless note-taking experience. It includes features like audio recording from different sources, breaking audio into manageable parts, and converting these into text. Once the text is generated, the app employs Generative AI to enhance the content, such as summarizing key points or expanding on topics as needed. The app operates independently, handling tasks like audio processing and data storage without user intervention, making it a reliable tool for students, professionals, and anyone looking to enhance their note-taking capabilities.

About Us

...
Dr. Maiga Chang
Supervisor

Dr. Maiga Chang is a Full Professor in the school of Computing and Information Systems at Athabasca University, Canada

...
Our Goal

The goal of the GenAI Supported Note Takers App is to efficiently convert spoken audio into organized, AI-enhanced text notes, ensuring data security and user convenience.

Our Team

...
Jagrit Acharya
Current Member

Jagrit Acharya
is an graduate student at University of Calgary, Canada (2023-2025)

Videos

Note Takers - App

This video shows users the Generative AI (GenAI) supported Note Takers app that VIP Research Group presents.

Key Features

Speech-to-Text Conversion

Record audio or import audio from lectures or meetings and convert spoken words to text with high accuracy using built-in or server-side recognition.

Advanced Security

End-to-end encryption with RSA, AES, SSL, and salted hash algorithms ensuring your audio and text data remains private and secure.

AI-Enhanced Notes

Generative AI from Google Gemini and OpenAI ChatGPT refines and organizes your notes, summarizing key points automatically.

Assign Recording Tags and Filter

On-the-fly transcription with immediate deletion of processed audio chunks, enhancing privacy and security.

User-Friendly Interface

Intuitive Android application with easy recording, editing, and organization of notes for students and professionals.

Share Everything

Easily share recordings, prompts, transcriptions, and enhanced notes with classmates, colleagues, or team members at the touch of a button.

Publications

Frequenty Asked Questions