Thomas Wood

Thomas Wood

I am a London-based freelance data scientist, available for consulting engagements especially around NLP (natural language processing). I help organisations extract value from unstructured data. If you have a large amount of text documents (examples include but are not limited to pharmaceutical regulatory documents, legal caseloads, credit reports), and would like to understand how this can benefit your organisation, and even quantify the benefit before getting started, please let me know.

Recent posts by Thomas Wood

Hire an NLP developer
Ai and nlpBusiness applications

Hire an NLP developer

Hire an NLP developer and untangle the power of natural language in your projects The world is buzzing with the possibilities of natural language processing (NLP). From chatbots that understand your needs to algorithms that analyse mountains of text data, NLP is revolutionising industries across the board. But harnessing this power requires the right expertise. That’s where finding the perfect NLP developer comes in. Why do I need to hire an NLP developer?

What is NLP?

What is NLP?

Natural language processing What is natural language processing? Natural language processing, or NLP, is a field of artificial intelligence that focuses on the interaction between computers and humans using natural language. NLP is a branch of AI but is really a mixture of disciplines such as linguistics, computer science, and engineering. There are a number of approaches to NLP, ranging from rule-based modelling of human language to statistical methods. Common uses of NLP include speech recognition systems, the voice assistants available on smartphones, and chatbots.

Hire an NLP data scientist
Ai and nlpBusiness applications

Hire an NLP data scientist

Hire an NLP data scientist and boost your business with AI As artificial intelligence transcends the realm of sci-fi and starts getting intricately woven into our everyday lives, the demand for specialized professionals to oversee its many dimensions has never been higher. If your company is looking to step into the future, now is the perfect time to hire an NLP data scientist! What is an NLP data scientist? Natural Language Processing (NLP), a subset of machine learning, focuses on the interaction between humans and computers via natural language.

Unsolved problems in natural language processing

Unsolved problems in natural language processing

Unsolved problems in natural language processing Here’s a walk through of some of NLP’s most intriguing unsolved mysteries. Forensic stylometry Who wrote which parts of the Federalist Papers? Who is Elena Ferrante? Who was S.W. Erdnase, the pseudonymous author of The Expert at the Card Table? Translation Can we make a machine translator as good as a human? Machine translation is an AI-complete problem, requiring an AI to have real-world knowledge to solve properly.

Which NLP corpus?

Which NLP corpus?

List of multilingual text corpora for natural language processing If you want to train your own large language model, try developing a stylometry model, or simply hone your NLP skills, you will need a corpus to work with. Some of the best known corpora include: The Open American National Corpus (OANC) The British National Corpus (BNC) Project Gutenberg Text corpora on authorship attribution (University of Neuchatel, the CLC group) The CLEF PAN corpora and tasks The Federalist Papers - a series of 85 essays written by Alexander Hamilton, John Jay, and James Madison.