Use Cases

Discover how SciBite’s powerful solutions are supporting scientists and researchers.

Use Cases Overview

Gartner report

Gartner® The Pillars of a Successful Artificial Intelligence Strategy

Access report

Knowledge Hub

Explore expert insights, articles, and thought leadership on scientific data challenges.

Knowledge Hub

Resources

Discover our whitepapers, spec sheets, and webinars for in-depth product knowledge.

Resources

Events

Join us at upcoming events and webinars to learn more about SciBite solutions.

Events

News

Stay informed with the latest SciBite updates, announcements, and industry news.

News

About SciBite

Explore SciBite’s full suite of solutions to unlock the potential of your data.

Discover more about us

Our Partners

We build powerful partnerships with world-leading organizations.

Our Partners

Data cleansing to unlock the potential of bioassay data [Use case]
Tractor working on the tulip field

The business challenge

A global pharmaceutical company recognized the potential of the huge volumes of bioassay data that they had generated but struggled to gain insights from this valuable resource. A lack of standardization across their data repositories, including LIMS and other bioassay databases, had resulted in different ways to describe the same thing, for example, ‘mouse’, ’mice’, ‘Mus musculus’ and ‘m. musculus’, making it hard to collate data for a particular species. This was compounded by the fact that some database fields were sparsely populated fields while others contained useful information buried in long assay descriptions.

The SciBite solution

We enriched our species, gene, and bioassay vocabularies with customer-specific terms and synonyms to ensure all relevant information would be recognized. We then analysed the assay names from the legacy database and extracted the different entities within each one. Each entity was extracted and mapped to a single, standard vocabulary term to normalize the data.

Figure 1: Extraction of Cell Line, Drug, Species and Target entities within the unstructured titles of a selection of assays. The resulting semantic index enables connections to be made between bioassays

Key business benefits

  • Assays are consistently and unambiguously tagged with key metadata
  • Enables the wealth of information in bioassay databases to be unlocked and exploited
  • Assays are consistently and unambiguously tagged with key metadata
  • Enables the wealth of information in bioassay databases to be unlocked and exploited
Share this article
Relevant resources, events and news