Skip to Content

This is archived information!

Search current Handbook for current unit information.

Document Processing and the Semantic Web - COMP348

This unit explores the issues involved in building natural language processing (NLP) applications that operate on large bodies of real text such as are found on the world wide web. With the web full of unstructured and largely text based data, the applications needed to handle this have their own particular characteristics. In this unit we discuss some core applications for dealing with data on the web, such as spam filtering and search engines. The unit also explores some developments of web, such as emerging semantic web technologies which support the exchange of XML metadata on the web, and Web 2.0 technologies (such as social networking, folksonomies, wikis and blogs). Application areas covered include information retrieval, web search, document summarisation, machine translation and information extraction.

Credit Points: 3
When Offered:

D1 - Day; Offered in Session 1, North Ryde

Staff Contact(s): Computing Staff
Prerequisites:

39cp and COMP249(P) Prerequisite Information

Corequisites:

NCCW(s):
Unit Designation(s):

Science

Information Technology

Unit Type:
Assessed As: Graded
Offered By:

Department of Computing

Faculty of Science

Timetable Information

For unit timetable information please visit the Timetables@Macquarie Website