Izadmehr Yasaman

Collaboratrice Ra&D

Hauptvertrag

Collaboratrice Ra&D

Büro: B31

Haute Ecole d'Ingénierie et de Gestion du Canton de Vaud
Route de Cheseaux 1, 1400 Yverdon-les-Bains, CH

Es müssen keine Daten für diesen Abschnitt angezeigt werden.

2022

Depth estimation for egocentric rehabilitation monitoring using deep learning algorithms

Wissenschaftlicher Artikel ArODES

Yasaman Izadmehr, Héctor F. Satizábal, Kamiar Aminian, Andres Perez-Uribe

Applied Sciences, 2022, vol. 12, no 13, article no. 6578

Link zur Publikation

2023

Enhancing hand and object detection for monitoring patients with upper-limb impairment : a study on the impact of input size in foundation models

Konferenz ArODES

Yasaman Izadmehr, Kamiar Aminian, Andres Perez-Uribe

Proceedings of the 2023 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

Link zur Konferenz

Zusammenfassung:

The choice of input image size can have a significant impact on the performance of the state-of-the-art algorithms. We can always customize the algorithms by training and fine-tuning them on our datasets, but it is time consuming. Nowadays there is a trend to use foundation models but in our application of monitoring patients, both hand detection, object detection and hand-object interaction detection resulted in mediocre performance. This study aimed to investigate the significance of input size for detecting hand-object interaction in two datasets: the patient dataset (captured by super view mode) and the EpicKitchen dataset (captured by normal view mode). The results showed that using different input sizes with the same foundation model can lead to a significant improvement in performance. In the patient dataset, using frames with input sizes of 300 × 300 pixels (px) and 256 × 256 px after cropping and resizing the original images led to more successful hand detection results. Furthermore, using video processing tools like FFmpeg for resizing frames instead of passing the original images to the MediaPipe model for resizing resulted in a 33% improvement. In the EpicKitchen dataset with normal view mode, successful hand detection results were obtained by resizing frames into a rectangle of 256 px and 300 px after padding and cropping the original images. Overall, the study emphasizes the significance of input size for detecting hand-object interaction detection for the purpose of monitoring patients with upper-limb impairment. The combination analysis within each dataset showed that the most effective combination in hand-object interaction detection is achieved by applying the MediaPipe model to an input image size of 300 × 300 px (for super view mode) or 256 × 256 px (for normal view mode) along with the result of YOLOv7 model with an input image size of 1920 × 1920 px. By using this combination, a 100% success rate was achieved for both datasets.

Errungenschaften

PEOPLE@HES-SO Verzeichnis der Mitarbeitenden und Kompetenzen

Izadmehr Yasaman

Collaboratrice Ra&D

Collaboratrice Ra&D

PEOPLE@HES-SO
Verzeichnis der Mitarbeitenden und Kompetenzen