
Semantic Search Engineer
- Brescia
- Contratto
- Full time
- Use Apache ManifoldCF to crawl SharePoint, file systems, or databases.
- Use Microsoft Graph API for structured Microsoft 365 data.
- Use Apache Marmotta to enrich data with RDF triples and linked data.
- Use Pipeship to manage ingestion, enrichment, and indexing workflows.
- Push enriched data into Apache Solr or AWS OpenSearch.
- Use custom analyzers and faceting for semantic search.
- Host components on AWS EC2/EKS, store data in S3, and monitor with CloudWatch.