We provide customised solutions to fit your requirements perfectly
What kinds of content can we work with?
- Legacy content in your current search engine
- News articles (digital or paper via OCR)
- Social Media (via APIs or feeds)
- Relational databases or systems based on them, such as content management systems (CMS)
- Flat files (in many formats such as Office or PDF)
- Web content (using intelligent open source crawlers and scrapers)
- XML including proprietary and in-house variants
What can we do with this content?
- Extract structured and unstructured data for indexing
- Identify entities such as names and locations
- Perform natural language processing and sentiment analysis
- Categorise documents into a taxonomy
- Monitor it using a flexible syntax to encapsulate themes, keywords and interests
- Store it in a searchable archive
- Find related content
How can your users interact with it?
- Search it with typed queries
- Monitor it for their interests or needs
- Match it to similar items
- Filter and browse it via fields, a classification heirarchy or facets
- Construct and modify a taxonomy and move content within it
- Receive documents automatically via email alerts
- Enhance content with user tagging
How scalable is all this and on what platform?
- Hundreds of millions of items can be stored
- Hundreds of thousands of new items can be acquired every day
- Tens of thousands of stored monitoring queries can be applied in under a second
- Costs are not based on number of documents, servers or instances
- Windows, Linux or Solaris
- Physical servers, virtual machines or the Cloud
- We can work with whatever platform, language or presentation layer you are currently using