We provide customised solutions to fit your requirements perfectly

What kinds of content can we work with?

  • Legacy content in your current search engine
  • News articles (digital or paper via OCR)
  • Social Media (via APIs or feeds)
  • Relational databases or systems based on them, such as content management systems (CMS)
  • Flat files (in many formats such as Office or PDF)
  • Web content (using intelligent open source crawlers and scrapers)
  • XML including proprietary and in-house variants

What can we do with this content?

  • Extract structured and unstructured data for indexing
  • Identify entities such as names and locations
  • Perform natural language processing and sentiment analysis
  • Categorise documents into a taxonomy
  • Monitor it using a flexible syntax to encapsulate themes, keywords and interests
  • Store it in a searchable archive
  • Find related content

How can your users interact with it?

  • Search it with typed queries
  • Monitor it for their interests or needs
  • Match it to similar items
  • Filter and browse it via fields, a classification heirarchy or facets
  • Construct and modify a taxonomy and move content within it
  • Receive documents automatically via email alerts
  • Enhance content with user tagging

How scalable is all this and on what platform?

  • Hundreds of millions of items can be stored
  • Hundreds of thousands of new items can be acquired every day
  • Tens of thousands of stored monitoring queries can be applied in under a second
  • Costs are not based on number of documents, servers or instances
  • Windows, Linux or Solaris
  • Physical servers, virtual machines or the Cloud
  • We can work with whatever platform, language or presentation layer you are currently using