Datasets

From Network for Advanced NMR
Revision as of 17:09, 22 May 2025 by Mmaciejewski (talk | contribs)
Jump to navigationJump to search

Data Browser: Datasets

The Dataset Browser allows users to explore and manage datasets they are authorized to access. Access permissions are granted either because the dataset is public or through lab-based, user-based, or collaborative permissions authorized by a Principal Investigator (PI).

The Dataset Browser includes:

  • A Navigation Pane on the left side for switching between dataset views and hierarchical organization of My & Lab data described below.
  • A Table View for displaying rows of datasets with columns representing metadata that may be sorted and filtered.
  • Customization Tools in the upper-right corner to configure columns, saved views, and filters.
  • An Upload Datasets button to submit datasets that were not harvested by NDTS.

Navigation Pane

The Navigation Bar allows users to quickly access datasets across different categories. Unauthenticated users will only see All Public Datasets and Knowledgebase Datasets.

All Datasets
Displays all datasets the user can access, including public and permission-granted datasets.
All Public Datasets
Displays only datasets marked as public.
Knowledgebase Datasets
Displays a curated subset of public datasets that are highly annotated and intended to aid users in experimental planning and analysis.
My & Lab Data
Displays datasets accessible via user- or lab-based permissions (excluding datasets that are visible only due to being public).
This section includes a hierarchical organization mirroring a file system:
  • My Collections – personal collections created by the user.
  • Projects – high-level lab groupings.
  • Studies – project subsets for specific investigations.
  • Collections – fine-grained dataset groupings within a study.

Table of Datasets

The Dataset Table displays all datasets for which the user has at least read access. Each row represents one dataset, with metadata columns that can be customized per user. At the bottom of the table is a pagination control. Users can move between pages and adjust the number of rows displayed per page: 25, 50, 100, or 500 datasets per page.

Display Name / Dataset Name

The Display Name is the first column and is always visible. It defaults to the non-editable Dataset Name, which corresponds to the experiment directory name:

  • VNMRJ: `expN`
  • Bruker: `experiment/N`

Users can edit the Display Name to create a more meaningful label. When downloading, the dataset is saved using the original Dataset Name; the Display Name is saved in a `CSV` file within the downloaded folder.


How to Customize the Dataset Table View

  1. Click the Wrench icon in the top-right corner.
  2. Choose Displayed Columns to select which columns are visible.
  3. Use Save View to save your configuration.
    • Views can be edited or deleted.
  4. Use Saved Views to switch between existing configurations.

How to Download Experiments

  1. Select one or more experiments using the checkbox icon or by right-clicking.
  2. Right-click and choose Download.
  3. Select the download format:
    • Organized for Topspin – maintains Bruker format hierarchy.
    • Organized by Experiment – each experiment in its own folder.

How to Link Datasets to a Sample

There are two ways:

  1. From the Dataset Editor
    • Double-click a dataset you have write access to.
    • Click Find & Link Sample, select a sample, and click Save.
  1. From the Table View
    • Select datasets, right-click, and choose Link Sample.
    • Select a sample and click Save.

Quick Filters

Quick filters apply predefined views to narrow down datasets:

  • Successful Datasets Only – shows datasets marked as successful.
  • Hide Failed Datasets – hides datasets marked as failed.
  • Non-Redundant Datasets – shows datasets marked as preferred.
  • My Data – datasets owned by the logged-in user.
  • Non-public Data – datasets not made public.
  • KB Datasets – datasets published in the Knowledgebase.

To classify or mark datasets:

  • Right-click the dataset and select Classification or Redundancy from the context menu.

Context Menu

Right-clicking a dataset opens the context menu. Available actions depend on the user’s permissions; unavailable actions appear grayed out.

Available options may include:

  1. Edit Dataset
    • Update metadata, classification, or redundancy status.
  2. Reassign
    • Assign to another lab user or reject misaligned data.
    • Rejected data (within 3 months) goes to the facility manager.
  3. Download
    • Download dataset(s).
  4. NMRbox Integration
    • Copy dataset to NMRbox home directory.
  5. Supplemental Data
    • Upload related data files.
  6. Redundancy
    • Mark as preferred/redundant.
  7. Link Sample
    • Associate with a sample (shows up in the Sample column).
  8. Classification
    • Label as:
      • Calibration experiment
      • Failed – sample, instrument, or setup related
      • Successful experiment
      • Test experiment
  9. Tags
    • Add searchable tags.
  10. Notes
    • Add or edit descriptive notes.
  11. Unlink from Collection
    • Remove from a dataset collection.
  12. Make Public
    • Permanently make dataset public (cannot be undone).
  13. Publish
    • Permanently publish dataset (cannot be undone or edited).