Skip to main content

Raw Data

Exporting Raw Data

You can use Researcher Dashboard to export data collected from data sources or activity responses. Both data sources and activity responses support export as CSV, though you have more options depending on the type of the data being exported. For example, for GPS you may also choose KML or GPX export, or for contact network data you may also choose to export as GEXF.

To export your data, open the researcher dashboard and navigate to the Data Export page:

Data Export page in Avicenna Researcher Dashboard

In this page, you start with choosing the list of participants for whom you want to export the data, and the type of data you want to export. Depending the type of data, Avicenna may ask you to choose the export format as well. For most data sources, you can download the data as a CSV file. Although for GPS you can also choose KML or GPX, and for Bluetooth Beacons you can choose GEXF as well.

After selecting the export format, you can choose the date range as well. Pressing Export will start the data export process. The export may take up to a few hours to complete, depending on the size of the data. You can always come back this page to check the status of your data export. When the data export file is ready, you will receive an email about it, and you can come back to the Data Export file to download the file.

Note that the Data Export page will also list all Survey Response export requests, even though for exporting the survey responses you need to use the Responses page, as explained in the View Responses document.

Each data export will be available for download for up to 7 days. After that, Avicenna automatically deletes the export file. If you need to download the data again, you must create a new data export.

At the moment Avicenna does not have any limitations on the size of the data export file. But for sensor data this file can be very large, specially if a long date range and many participants are chosen to be exported. So it's not uncommon for the file to surpass 10GB in size. There is no limitation at the moment on the file size, though we do request that if you expect your data export to be very large, break it into multiple requests, so it can generate multiple files.

Data Fields

The data fields you will find in each file depend on the type of data being exported. For survey responses, the list of available fields are explained in the View Responses document. For the sensor-based data sources, the list of data sources are described in their related section in the Data Sources document.

Downloading Record Counts

You can download a CSV file containing the count of all records grouped by data source. This provides a quick overview of the data collection volume across different data sources in your study.

To do that, on the Data Export page, click on Download All Data Sources Record Counts as CSV.

The downloaded CSV file includes the following columns:

  • data_source_id: The unique identifier for the data source. See the Data Source ID Reference for more details.
  • user_id: The participant's unique identifier.
  • participant_type: Either "Main" or "Test".
  • device_id: The unique identifier of the device that collected the data.
  • time_bin: The date for which the count is reported.
  • count: The number of records collected for that data source on that date.

Data Source ID Reference

The following table provides a reference for mapping data source IDs to their corresponding data sources:

IDData Source
1Accelerometer
2Ambient Temperature
3Gyroscope
4Gravity
5Light
6Linear Acceleration
7Magnetic Field
8Orientation
9Pressure
10Proximity
11Relative Humidity
13WiFi
14Bluetooth
15GPS
16Battery
19Ambient Audio
20App Usage (Legacy)
24Screen State
25Pedometer
26Activity Recognition
27Bluetooth Beacon
30Fitbit Heart Rate
33Fitbit Sleep
37Garmin Health
39HealthKit
42Garmin Health Daily
43Garmin Health Heart
44Garmin Health Respiration
45Garmin Health Sleep Daily
46Garmin Health Sleep
47Fitbit Activity Summary
48Garmin Health Pulse Ox
49Garmin Health Stress
50Garmin Health Body Composition
51Garmin Health User Metrics
52Weather
53Fitbit Activity
54Fitbit Sleep Level
55Fitbit Active Zone
58WHOOP Workout
59WHOOP Sleep
60WHOOP Recovery
61Polar Exercise
62Polar Sleep
63Polar Continuous Heart Rate
64Polar SleepWise Circadian Bedtime
65Polar SleepWise Alertness
66Fitbit Weight Log
67SensorKit Heart Rate
68SensorKit Accelerometer
69SensorKit Rotation Rate
70SensorKit Ambient Light
71SensorKit Ambient Pressure
72SensorKit Device Usage Report
73SensorKit Keyboard Metrics
74SensorKit Message Usage Report
75SensorKit On Wrist State
76SensorKit Pedometer
77SensorKit Phone Usage Report
78SensorKit Telephony Speech Metrics
79SensorKit Siri Speech Metrics
80SensorKit Visits
81SensorKit Wrist Temperature
82Hexoskin Shirt
98HealthKit Activity
99HealthKit Vital Signs
100HealthKit Sleep Analysis
103HealthKit State of Mind
105Web Activity Tracking

Direct Database Access

While Avicenna's data export allows you to create complex queries and export any data from your study as CSV, this will not cover all analysis cases. For more advanced use-cases, you may need to connect directly to the database.

We can provide direct database access to your team to query and work with your study data. At the moment, this feature is not automatically provided. If you need to have direct database access, please contact Avicenna Support.

Handling of Timezones

Every piece of information stored in Avicenna is time-stamped as appropriate. All time values are stored internally in UTC.

However, keep in mind that all participants' data exported from your study will be based on the participants' timezones. This is because presenting participants' data in their local timezones enhances researchers' ability to interpret and analyze the data accurately, aligning it with the study protocol and the participants' context.