Australian Digital Thesis of Classification of HTML Documents