Who Danced Better? Ranked TikTok Dance Video Dataset and Pairwise Action Quality Assessment Method

Irwandi, Hipiny and Hamimah, Ujir and Aidil Azli, Alias and Musdi, Shanat and Mohamad Khairi, Ishak (2023) Who Danced Better? Ranked TikTok Dance Video Dataset and Pairwise Action Quality Assessment Method. International Journal of Advances in Intelligent Informatics, 9 (1). pp. 1-12. ISSN 2548-3161

[img] PDF
Who Danced - Copy.pdf

Download (475kB)
Official URL: https://ijain.org/index.php/IJAIN/article/view/919

Abstract

Video-based action quality assessment (AQA) is a non-trivial task due to the subtle visual differences between data produced by experts and non-experts. Current methods are extended from the action recognition domain, where most are based on temporal pattern matching. AQA has additional requirements where order and tempo matter for rating the quality of an action. We present a novel dataset of ranked TikTok dance videos and a pairwise AQA method for predicting which video of a same-label pair was sourced from the better dancer. Exhaustive pairings of same-label videos were randomly assigned to 100 human annotators, ultimately producing a ranked list per label category. Our method relies on a successful detection of the subject’s 2D pose inside successive query frames where the order and tempo of actions are encoded inside a produced String sequence. The detected 2D pose returns a top-matching Visual word from a Codebook to represent the current frame. Given a same-label pair, we generate a String value of concatenated Visual words for each video. By computing the edit distance score between each String value and the Gold Standard’s (i.e., the top-ranked video(s) for that label category), we declare the video with the lower score as the winner. The pairwise AQA method is implemented using two schemes, i.e., with and without text compression. Although the average precision for both schemes over 12 label categories is low, at 0.45 with text compression and 0.48 without, precision values for several label categories are comparable to past methods (median: 0.47, max: 0.66).

Item Type: Article
Uncontrolled Keywords: Action Quality Assessment, Dance Video Dataset, Human Activity Analysis, String Matching, Visual Codebook.
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Academic Faculties, Institutes and Centres > Faculty of Computer Science and Information Technology
Faculties, Institutes, Centres > Faculty of Computer Science and Information Technology
Academic Faculties, Institutes and Centres > Faculty of Computer Science and Information Technology
Depositing User: Mohamad Hipiny
Date Deposited: 17 Mar 2023 01:19
Last Modified: 02 Aug 2023 02:51
URI: http://ir.unimas.my/id/eprint/41536

Actions (For repository members only: login required)

View Item View Item