Cheuk Ting Ho
@cheukting_ho
Cheukting
General approach:
Vectorization to capture the content then compare vectors using Cosine-similarity
It can do that better than TF-IDF and BoW
For that reason, the key is:
Would the vectorization method capture the contextual content?
and the best is:
Choice of pre-train model:
Choice of custom-train model:
By Cheuk Ting Ho
Developer advocate / Data Scientist - support open-source and building the community.