Deep temporal architectures for activity recognition

dc.contributor.advisorGrobler, H.
dc.contributor.emailu28216882@tuks.co.za
dc.contributor.postgraduateLuvhengo, Todani
dc.date.accessioned2018-12-05T08:04:54Z
dc.date.available2018-12-05T08:04:54Z
dc.date.created2009/07/18
dc.date.issued2017
dc.descriptionDissertation (MEng)--University of Pretoria, 2017.
dc.description.abstractThe amount of video content generated increases daily, three hundred hours of video content is uploaded to YouTube every 60 seconds1. There exists a need to sort, summarise, describe, categorise and retrieve video data based on the content (i.e. the activities occurring in the video). Activity recognition (i.e. automatically naming activities) is an important area for video analysis. Activity recognition has applications in robotics, video surveillance, multimedia retrieval, behaviour analysis, disaster warning systems and content-based browsing. Automatically categorising activities given a video clip poses two main challenges, namely object detection and motion learning. An activity recognition system must detect and localise the agent as well as learn to categorise the action the agent is performing. This research hypothesises that learning models incorporating spatial and temporal aspects from video data should outperform models that learn only spatial or temporal features on activity recognition learning tasks. The above hypothesis is investigated by developing two deep learning architectures for activity recognition that learn temporally independent and dependent features respectively.minima do not exist. A recurrent network (structurally constrained gated recurrent unit (SCGRU)) that adds contextual feature learning to gated recurrent units (GRUs) is proposed. Adding contextual features stabilises the hidden state of a GRU layer. The approach taken to investigate activity recognition architectures in this research involved examining the architectures on four benchmark datasets and analysing the results to 1) find the best model for activity recognition, 2) examine the model’s ability to learn salient temporal features, and 3) examine the model’s computational complexity. SCGRU based models outperform GRU based models on the majority of the investigated activity recognition models and datasets.
dc.description.availabilityUnrestricted
dc.description.degreeMEng
dc.description.departmentElectrical, Electronic and Computer Engineering
dc.identifier.citationLuvhengo, T 2017, Deep temporal architectures for activity recognition, MEng Dissertation, University of Pretoria, Pretoria, viewed yymmdd <http://hdl.handle.net/2263/67779>
dc.identifier.otherS2018
dc.identifier.urihttp://hdl.handle.net/2263/67779
dc.language.isoen
dc.publisherUniversity of Pretoria
dc.rights© 2018 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
dc.subjectUnrestricted
dc.subjectUCTD
dc.titleDeep temporal architectures for activity recognition
dc.typeDissertation

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Luvhengo_Deep_2017.pdf
Size:
3.54 MB
Format:
Adobe Portable Document Format
Description: