1
0
mirror of https://github.com/oceanprotocol/docs.git synced 2024-11-26 19:49:26 +01:00
docs/data-science/data-challenges/participating-in-a-data-challenge.md

1.9 KiB
Raw Blame History

Participating in a data challenge

Data challenges can take a few different formats. Some challenges are built for data exploration and reporting. In these challenges, participants are tasked with analyzing a provided dataset or several and conducting an exploratory analysis to understand the hidden insights in the data. Then, they build a written report that explains their insights so that a business user can make informed decisions from the analysis. Another format is the participants being tasked with building a model to perform a given task. They will typically be provided with a dataset(although not always) that they will use to train their model. In some challenges, the participants must publish their model as a compute asset on Ocean so that the model can be run using Compute-to-Data. Here is the typical flow for a data challenge.\

  1. On Ocean Market, a dataset( or several) is published along with its description and schema. The dataset will either be provided by the data challenge sponsor partner or by OPF ourselves.
  2. Participants should download the dataset(s), or a sample if it the data is private. This will allow them to perform exploratory analysis to understand the dataset
  3. Based on the data challenge, there are several different ways participants may be tasked with entering the competition, One way is to build a report that combines data visualization and written explanations so that business stakeholders can gain actionable insights into the data. Another way is participants may be tasked with building a machine-learning model to predict a specific target value.
  4. Users will typically submit directly with Ocean Technologies. They can post their reports or algorithms onto the Ocean Market. Those that produce strong submissions can monetize their work by using Oceans Compute-to-Data engine so that the model can be run by others in the future for a price.