Irwandi, Hipiny and Hamimah, Ujir and Aidil Azli, Alias and Musdi, Shanat and Mohamad Khairi, Ishak (2023) Who Danced Better? Ranked TikTok Dance Video Dataset and Pairwise Action Quality Assessment Method. International Journal of Advances in Intelligent Informatics, 9 (1). pp. 1-12. ISSN 2548-3161
PDF
Who Danced - Copy.pdf Download (475kB) |
Abstract
Video-based action quality assessment (AQA) is a non-trivial task due to the subtle visual differences between data produced by experts and non-experts. Current methods are extended from the action recognition domain, where most are based on temporal pattern matching. AQA has additional requirements where order and tempo matter for rating the quality of an action. We present a novel dataset of ranked TikTok dance videos and a pairwise AQA method for predicting which video of a same-label pair was sourced from the better dancer. Exhaustive pairings of same-label videos were randomly assigned to 100 human annotators, ultimately producing a ranked list per label category. Our method relies on a successful detection of the subject’s 2D pose inside successive query frames where the order and tempo of actions are encoded inside a produced String sequence. The detected 2D pose returns a top-matching Visual word from a Codebook to represent the current frame. Given a same-label pair, we generate a String value of concatenated Visual words for each video. By computing the edit distance score between each String value and the Gold Standard’s (i.e., the top-ranked video(s) for that label category), we declare the video with the lower score as the winner. The pairwise AQA method is implemented using two schemes, i.e., with and without text compression. Although the average precision for both schemes over 12 label categories is low, at 0.45 with text compression and 0.48 without, precision values for several label categories are comparable to past methods (median: 0.47, max: 0.66).
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Action Quality Assessment, Dance Video Dataset, Human Activity Analysis, String Matching, Visual Codebook. |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | Academic Faculties, Institutes and Centres > Faculty of Computer Science and Information Technology Faculties, Institutes, Centres > Faculty of Computer Science and Information Technology Academic Faculties, Institutes and Centres > Faculty of Computer Science and Information Technology |
Depositing User: | Mohamad Hipiny |
Date Deposited: | 17 Mar 2023 01:19 |
Last Modified: | 02 Aug 2023 02:51 |
URI: | http://ir.unimas.my/id/eprint/41536 |
Actions (For repository members only: login required)
View Item |