Data Collection Methodology Using MediaPipe Framework

The data-collection program has been completed, and the next step is to begin collecting data to construct the dataset. For this purpose, 38 different signs will be collected, with a minimum of 800 data records expected for each sign. The data collection procedure will be executed in the following order:

1.The program will be executed via the terminal with the specified sign name.

2.The individual will present the hand sign to the webcam and commence data collection by pressing ‘0’.

The advantage of this data collection program is that the background does not affect the quality of the collected data. This is due to the capability of the MediaPipe framework to recognize the hand regardless of other visual noise in the background. Additionally, the distance between the hand and the camera does not impact data quality, as the program normalizes the joint distances from the wrist joint.

To enhance the robustness of the collected data, the following measures will be implemented:

1. The orientation of the hand will be slightly varied during the data collection process for each hand sign.

2. Multiple individuals will collect data for the same hand sign.

3. During data collection, the hand will be moved back and forth to capture different distance measures.

These measures are necessary for the following reasons:

If only upright hand positions are used for training, the model may struggle to correctly recognize gestures if the hand is tilted. Since some gestures are distinguished by minor differences in the position of one or two fingers, slight changes could confuse the multi-layer perceptron (MLP) model. Furthermore, different individuals have varying hand sizes, resulting in different joint distances. Training the model on data from a single individual would limit its ability to recognize signs made by hands of different sizes and shapes, which is undesirable for a sign language translator intended for diverse users.

My project guide Dr.Maha Saadeh suggested measures to address these issues.The data collection process will involve four individuals, with each person collecting at least 200 data records per hand gesture with slight orientation variations. Consequently, 800 records will be collected per hand gesture, ensuring that the MLP model can correctly identify hand gestures despite variations in hand size and shape. Further analysis may include more hands if necessary. The data collection program currently accepts input from one hand. By changing the `max_num_hands` parameter to 2, the program has to be altered in such a way that it can accept data points for signs that use both hands. However, mixing single-handed and double-handed sign data in the same model could cause confusion. Therefore, separate models will be trained for single-handed and double-handed signs.

Comments

Popular posts from this blog

Understanding Saudi Arabic Sign Language Gestures

Understanding and implementation of Multilayer Perceptrons (MLP)