Data cleansing to unlock the potential of bioassay data [Use case] - SciBite

Solutions
Solutions
Explore SciBite’s full suite of solutions to unlock the potential of your data.

Solutions Overview
Ontology Management

CENtree Ontology Manager

Expert Ontology Services
Semantic Data Enrichment

TERMite Text Analysis Engine
Semantic Search

SciBite Search

SciBite Chat
Datasets Our Partners Data Science and Professional Services (DSPS)
Use Cases
Use Cases
Discover how SciBite’s powerful solutions are supporting scientists and researchers.

Use Cases Overview
Bioassay Registration Departmental Search Drug Safety Electronic Laboratory Notebooks
Enterprise Fair Data LLMs & GenAI Knowledge Graphs Target Validation + Drug Repositioning
Gartner report
Gartner® How to Build Knowledge Graphs That Enable AI-Driven Enterprise Applications

Access report

Read report
Knowledge Hub

Resources
Discover our whitepapers, spec sheets, and webinars for in-depth product knowledge.

Resources

Events
Join us at upcoming events and webinars to learn more about SciBite solutions.

Events

News
Stay informed with the latest SciBite updates, announcements, and industry news.

News

Ctrl Alt Tech Podcast
Where technology meets curiosity. In each episode, we chat with expert guests to explore a wide range of STEM topics.

Podcast

Sign up for the Podcast
About
About SciBite
Explore SciBite’s full suite of solutions to unlock the potential of your data.

Discover more about us
Why SciBite Management Team SciBite Academy Careers
Our Partners
We build powerful partnerships with world-leading organizations.

Our Partners
Clinical Data Partners Data Management Platforms ELN Partners Enterprise Search Partners Knowledge Graph Partners
Sign up for the Podcast
Contact Us

Solutions

Solutions

Explore SciBite’s full suite of solutions to unlock the potential of your data.

Solutions Overview

Ontology Management

Semantic Data Enrichment

Semantic Search

Datasets Our Partners Data Science and Professional Services (DSPS)

Use Cases

Use Cases

Discover how SciBite’s powerful solutions are supporting scientists and researchers.

Use Cases Overview

Bioassay Registration Departmental Search Drug Safety Electronic Laboratory Notebooks

Enterprise Fair Data LLMs & GenAI Knowledge Graphs Target Validation + Drug Repositioning

Gartner report

Gartner® How to Build Knowledge Graphs That Enable AI-Driven Enterprise Applications

Businessman Working on Laptop

Knowledge Hub

Resources

Discover our whitepapers, spec sheets, and webinars for in-depth product knowledge.

Events

Join us at upcoming events and webinars to learn more about SciBite solutions.

News

Stay informed with the latest SciBite updates, announcements, and industry news.

Ctrl Alt Tech Podcast

Where technology meets curiosity. In each episode, we chat with expert guests to explore a wide range of STEM topics.

Sign up for the Podcast

About

About SciBite

Explore SciBite’s full suite of solutions to unlock the potential of your data.

Discover more about us

Why SciBite Management Team SciBite Academy Careers

Our Partners

We build powerful partnerships with world-leading organizations.

Clinical Data Partners Data Management Platforms ELN Partners Enterprise Search Partners Knowledge Graph Partners

Sign up for the Podcast

Contact Us

SciBite / Knowledge Hub / Resources / Data cleansing to unlock the potential of bioassay data [Use case]

Data cleansing to unlock the potential of bioassay data [Use case]

Tractor working on the tulip field

The business challenge

A global pharmaceutical company recognized the potential of the huge volumes of bioassay data that they had generated but struggled to gain insights from this valuable resource. A lack of standardization across their data repositories, including LIMS and other bioassay databases, had resulted in different ways to describe the same thing, for example, ‘mouse’, ’mice’, ‘Mus musculus’ and ‘m. musculus’, making it hard to collate data for a particular species. This was compounded by the fact that some database fields were sparsely populated fields while others contained useful information buried in long assay descriptions.

The SciBite solution

We enriched our species, gene, and bioassay vocabularies with customer-specific terms and synonyms to ensure all relevant information would be recognized. We then analysed the assay names from the legacy database and extracted the different entities within each one. Each entity was extracted and mapped to a single, standard vocabulary term to normalize the data.

Figure 1: Extraction of Cell Line, Drug, Species and Target entities within the unstructured titles of a selection of assays. The resulting semantic index enables connections to be made between bioassays

Key business benefits

Assays are consistently and unambiguously tagged with key metadata
Enables the wealth of information in bioassay databases to be unlocked and exploited

Assays are consistently and unambiguously tagged with key metadata
Enables the wealth of information in bioassay databases to be unlocked and exploited

Download Use Case

Share this article

Relevant resources, events and news

https://scibite.com/knowledge-hub/news/fair-as-a-means-to-get-value-from-your-data/ thumbnail image

News FAIR as a means to get more value from your data

Unlocking value from your data: Learn how FAIR principles drive pharmaceutical data transformation with AI/ML and knowledge graphs.

Read more

https://scibite.com/knowledge-hub/news/fair-data-ten-simple-rules-to-fairify-your-data/ thumbnail image

News FAIR data: Ten simple rules to FAIRify your data

In the fourth and final blog in this series Scibite’s Head of Ontologies, Jane Lomax, shares her top 10 simple rules to start and progress your FAIR data journey.

Read more

https://scibite.com/knowledge-hub/news/why-do-you-need-fair-data/ thumbnail image

News Why do you need FAIR data?

For many companies, the idea of adopting FAIR can be confusing & daunting. Over the coming weeks we will address Why do you need FAIR data?

Read more

Please enter your details to get this resource.

"*" indicates required fields

Gated Content Information

First Name*

Surname*

Email Address*

Telephone number*

Job Title*

Organization*

Country*

State

Consent

From time to time we may contact you with relevant information about our company, products and data. Please tick this box to confirm your consent.