Innovative NLP Breakthroughs in Hebrew and Arabic at IAHLT

Lecturers: Avner Algom and Noam Ordan, 21.12.2023

In this presentation, we introduce IAHLT, detailing our organization’s motivations, business model, and focus on linguistic annotation for NLP applications. We’ll present our team and methodology, explaining how we manage multiple projects and handle large datasets. The journey of a document through various linguistic annotation pipelines, ensuring data integrity, will be illustrated. We’ll highlight three key datasets: one focusing on Hebrew morphology and syntax across multiple genres with rich metadata; another on Arabic morphology, covering both written and spoken forms; and a novel corpus of dialectical Arabic, derived from spoken data, designed to mirror the language as it appears in social media.

Speakers:
 Avner Algom, GM of IAHLT.ORG
Avner Algom has more than 30 years of R&D and business development experience in the Hi-tech industry, founder of the AI/Data Science community from industry and academia, focused on innovation, knowledge sharing and networking for implementing AI/Data Science solutions.
• Noam Ordan, PhD, CTO of IAHLT.ORG
Noam Ordan, is a computational linguist with two decades of experience in both academia and industry. His work has focused on text classification and machine translation, leading to numerous publications in prominent conferences and journals.
Language: English

Slides

Part 1

Part 2


——————————————————————————————————————————
I put a lot of thoughts into these blogs, so I could share the information in a clear and useful way.
If you have any comments, thoughts, questions, or you need someone to consult with,

feel free to contact me via LinkedIn – Omid Vahdaty:

Discover more from Big Data Demystified

Subscribe now to keep reading and get access to the full archive.

Continue reading