PradeepIT Consulting Services Pvt Ltd
Job description:
NLP Engineer / Machine Learning Engineer Document Understanding & Knowledge Graphs
- Overview Were looking for a hands-on NLP/ML engineer to lead the development of an intelligent document understanding pipeline for extracting structured data from complex, unstructured RFQ documents (40100+ pages, in German and English).
- You will be responsible for building scalable systems that combine document parsing, layout analysis, entity extraction, and knowledge graph construction ultimately feeding downstream (e.g. Analytics and LLM applications.)
- Key Responsibilities – – – – – –
- Design and implement document hierarchy and section segmentation pipelines using layout-aware models (e.g., DocLayout-YOLO, LayoutLM, Donut).
- Build multilingual entity recognition and relation extraction systems across both English and German texts.
- Use tools like NLTK, transformers, and spaCy to develop custom tokenization, parsing, and information extraction logic.
- Construct and maintain knowledge graphs representing semantic relationships between extracted elements using graph data structures and graph databases (e.g. Neo4j) Integrate outputs into structured LLM-friendly formats (e.g., JSON, Mark Down) for downstream extraction of building material elements.
- Collaborate with product and domain experts to align on information schema, ontology, and validation methods. What Were Looking For – – – –
- Strong experience in NLP, document understanding, and information extraction from unstructured/multilingual documents.
- Proficiency in Python, with experience using libraries such as transformers, spaCy, and NLTK. Hands-on experience with layout-aware models like DocLayout-YOLO, LayoutLM, Donut, or similar.
- Familiarity with knowledge graphs and graph databases such as Neo4j, RDF
Originally posted on Himalayas
To apply for this job please visit himalayas.app.
Working in United States
The United States of America (USA), also known as the United States (U.S.) or America, is a country primarily located in North America. It is a federal republic consisting of 50 states and a federal capital district, Washington, D.C. The 48 contiguous states border Canada to the north and Mexico to the south, with the semi-exclave of Alaska in the northwest and the archipelago of Hawaii in the Pacific Ocean. The United States also asserts sovereignty over five major island territories and various uninhabited islands in Oceania and the Caribbean. It is a megadiverse country, with the world's th
More jobs at PradeepIT Consulting Services Pvt Ltd
Keep exploring on Get A Job.ai
Not quite the right fit? Your next opportunity is a click away.
- Browse all jobs
- More jobs by category
- Remote jobs you can do from anywhere
- Research typical pay for this role
- Set a job alert so new matches reach you first
- Upload your resume to apply faster
Hiring instead? Post a job and reach candidates searching right now.