Leveraging the multimodal information from video content for video recommendation

Show simple item record

dc.contributor.advisor De Villiers, Johan Pieter
dc.contributor.coadvisor De Freitas, Allan
dc.contributor.postgraduate Almeida, Adolfo Ricardo Lopes De
dc.date.accessioned 2021-07-27T08:39:30Z
dc.date.available 2021-07-27T08:39:30Z
dc.date.created 2021
dc.date.issued 2021
dc.description Dissertation (MEng (Computer Engineering))--University of Pretoria, 2021. en_ZA
dc.description.abstract Since the popularisation of media streaming, a number of video streaming services are continually buying new video content to mine the potential profit. As such, newly added content has to be handled appropriately to be recommended to suitable users. In this dissertation, the new item cold-start problem is addressed by exploring the potential of various deep learning features to provide video recommendations. The deep learning features investigated include features that capture the visual-appearance, as well as audio and motion information from video content. Different fusion methods are also explored to evaluate how well these feature modalities can be combined to fully exploit the complementary information captured by them. Experiments on a real-world video dataset for movie recommendations show that deep learning features outperform hand crafted features. In particular, it is found that recommendations generated with deep learning audio features and action-centric deep learning features are superior to Mel-frequency cepstral coefficients (MFCC) and state-of-the-art improved dense trajectory (iDT) features. It was also found that the combination of various deep learning features with textual metadata and hand-crafted features provide significant improvement in recommendations, as compared to combining only deep learning and hand-crafted features. en_ZA
dc.description.availability Unrestricted en_ZA
dc.description.degree MEng (Computer Engineering) en_ZA
dc.description.department Electrical, Electronic and Computer Engineering en_ZA
dc.description.sponsorship The MultiChoice Research Chair of Machine Learning at the University of Pretoria en_ZA
dc.description.sponsorship UP Postgraduate Masters Research bursary en_ZA
dc.identifier.citation * en_ZA
dc.identifier.other S2021 en_ZA
dc.identifier.uri http://hdl.handle.net/2263/80994
dc.language.iso en en_ZA
dc.publisher University of Pretoria
dc.rights © 2019 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
dc.subject Video recommendation en_ZA
dc.subject item cold-start en_ZA
dc.subject deep learning features en_ZA
dc.subject multimodal feature fusion en_ZA
dc.subject matrix scaling en_ZA
dc.title Leveraging the multimodal information from video content for video recommendation en_ZA
dc.type Dissertation en_ZA


Files in this item

This item appears in the following Collection(s)

Show simple item record