Deep temporal architectures for activity recognition

Deep temporal architectures for activity recognition

dc.contributor.advisor	Grobler, H.
dc.contributor.email	u28216882@tuks.co.za
dc.contributor.postgraduate	Luvhengo, Todani
dc.date.accessioned	2018-12-05T08:04:54Z
dc.date.available	2018-12-05T08:04:54Z
dc.date.created	2009/07/18
dc.date.issued	2017
dc.description	Dissertation (MEng)--University of Pretoria, 2017.
dc.description.abstract	The amount of video content generated increases daily, three hundred hours of video content is uploaded to YouTube every 60 seconds1. There exists a need to sort, summarise, describe, categorise and retrieve video data based on the content (i.e. the activities occurring in the video). Activity recognition (i.e. automatically naming activities) is an important area for video analysis. Activity recognition has applications in robotics, video surveillance, multimedia retrieval, behaviour analysis, disaster warning systems and content-based browsing. Automatically categorising activities given a video clip poses two main challenges, namely object detection and motion learning. An activity recognition system must detect and localise the agent as well as learn to categorise the action the agent is performing. This research hypothesises that learning models incorporating spatial and temporal aspects from video data should outperform models that learn only spatial or temporal features on activity recognition learning tasks. The above hypothesis is investigated by developing two deep learning architectures for activity recognition that learn temporally independent and dependent features respectively.minima do not exist. A recurrent network (structurally constrained gated recurrent unit (SCGRU)) that adds contextual feature learning to gated recurrent units (GRUs) is proposed. Adding contextual features stabilises the hidden state of a GRU layer. The approach taken to investigate activity recognition architectures in this research involved examining the architectures on four benchmark datasets and analysing the results to 1) find the best model for activity recognition, 2) examine the model’s ability to learn salient temporal features, and 3) examine the model’s computational complexity. SCGRU based models outperform GRU based models on the majority of the investigated activity recognition models and datasets.
dc.description.availability	Unrestricted
dc.description.degree	MEng
dc.description.department	Electrical, Electronic and Computer Engineering
dc.identifier.citation	Luvhengo, T 2017, Deep temporal architectures for activity recognition, MEng Dissertation, University of Pretoria, Pretoria, viewed yymmdd <http://hdl.handle.net/2263/67779>
dc.identifier.other	S2018
dc.identifier.uri	http://hdl.handle.net/2263/67779
dc.language.iso	en
dc.publisher	University of Pretoria
dc.rights	© 2018 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
dc.subject	Unrestricted
dc.subject	UCTD
dc.title	Deep temporal architectures for activity recognition
dc.type	Dissertation

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Luvhengo_Deep_2017.pdf
Size:: 3.54 MB
Format:: Adobe Portable Document Format
Description:

Download

Collections

Theses and Dissertations (University of Pretoria)
Theses and Dissertations (Electrical, Electronic and Computer Engineering)

Simple item page