Raw Data
<
Topic Modelling - Mallet
OCR quality pretty low
But, topic modelling actually kind of worked
0    0.5    tre whe pus pur wor ant war hhe art ane pue ore oun thy ami oat ter tie wat ame
1    0.5    chinese year china labour malaya work opium malay states tin years number state perak selangor immigration rubber report coolies price
2    0.5    tho tha wore thoe lhe party local whioh wan tang thore kmt min kuo aro ard amt thoy aml bean
3    0.5    time place house day made small road side river miles sea men island people country back found man water town
4    0.5    tae und tne tie long white car black red tre sort top ani thin open trees blue aad men snd
5    0.5    chinese time good day evening club dinner home morning man people house round amoy night party left bit didn‘t sunday
6    0.5    chinese government governor ordinance council state dated sir straits singapore subject settlements despatches despatch law time report present reply made
7    0.5    societies police year society secret members chinese report annual persons penang men cases singapore gang banished crime number reports trouble
8    0.5    banishment july reply june china tan singapore reports april chinese feb jan lim list sept dec colony governor nov life
9    0.5    letter time don‘t i‘m dad mother betty week good things letters i‘ve home give write dear book days love office
Slides: slides.com/jdingle/c4l18n