Minimizing Communication Barriers with AI: Project Aims and Methodology

Hey everyone!

In this post, I will outline the aim and objectives of my project, delve into the methodology I am using to achieve the same.

Aims and Objectives

The primary aim of my project is to minimize communication barriers between the deaf and non-deaf communities in Saudi Arabia using deep-learning techniques.

Here are the key objectives I am working towards:

  • Understanding Saudi Arabic Sign Language Gestures: Analyzing and comprehending the nuances of SSL to accurately capture and translate gestures.
  • Creating a Gesture-Capturing Tool: Developing a program to capture SSL gestures and build a comprehensive dataset.
  • Building and Training a Deep Learning Model: Using the dataset to train a Multi-Layer Perceptron model, ensuring accurate gesture recognition.
  • Developing a Desktop Application: Creating an intuitive interface for users to interact with the trained model, facilitating real-time gesture translation.
  • ROS node Development: The implemented system will be ported as a ROS node, so it can be used across any robotic system which is ROS compatible.

Brief Methodology

To achieve these objectives, I have outlined a detailed methodology that combines data collection, model training, and application development:

  1. Data Collection and Gesture Analysis: Utilizing online resources to analyze and understand SSL gestures, studying hand movements within the context of Saudi Arabic Sign Language.
  2. Gesture Detection and Dataset Creation: Developing a program using Python and MediaPipe to detect hand landmarks and create a robust dataset.
  3. Model Training: Training a Multi-Layer Perceptron model using the collected dataset to recognize SSL gestures.
  4. Real-Time Gesture Recognition: Developing a program for real-time gesture identification and displaying recognized gestures on-screen.
  5. User Interface Development: Building a desktop application using TKinter for easy interaction with the model.
  6. Model Evaluation: Ensuring the model's accuracy through testing with real-world datasets.

Research Questions

How can deep learning be used to overcome the lack of tools to translate Saudi Arabia sign language (SSL) into Arabic to provide seamless communication between the deaf community and society?

Deliverables

  • A tool to extract data points using MediaPipe library
  • GUI for user interaction
  • Make complete project as a ROS node for deployment in Pepper Robot
  • Thesis Report

Evaluation Strategy

  • Conduct live tests with volunteers fluent in SSL to evaluate real-time performance.
  • Gather feedback on the desktop application's interface and usability, and iterate based on feedback.
  • Test the model under various conditions to ensure robustness.
  • Analyze errors and refine the model to improve accuracy.

Gantt Chart

Link for Gantt Chart as it will be updated continuously:Gantt Chart

Thanks until next post!

Comments

Popular posts from this blog

Understanding Saudi Arabic Sign Language Gestures

Understanding and implementation of Multilayer Perceptrons (MLP)