Deduplicating itinerary activities using MinHash LSH and semantic clustering
Extract PDFs and other documents into structured itineraries