df_util

Functions

convert_to_form(df, hed_schema, tag_form[, ...])

Convert all tags in underlying dataframe to the specified form (in place).

expand_defs(df, hed_schema, def_dict[, columns])

Expands any def tags found in the dataframe.

get_assembled(tabular_file, sidecar, hed_schema)

Load a tabular file and its associated HED sidecar file.

process_def_expands(hed_strings, hed_schema)

Gather def-expand tags in the strings/compare with known definitions to find any differences

shrink_defs(df, hed_schema[, columns])

Shrink (in place) any def-expand tags found in the specified columns in the dataframe.

convert_to_form(df, hed_schema, tag_form, columns=None)[source]

Convert all tags in underlying dataframe to the specified form (in place).

Parameters:
  • df (pd.Dataframe or pd.Series) – The dataframe or series to modify

  • hed_schema (HedSchema) – The schema to use to convert tags.

  • tag_form (str) – HedTag property to convert tags to.

  • columns (list) – The columns to modify on the dataframe.

expand_defs(df, hed_schema, def_dict, columns=None)[source]

Expands any def tags found in the dataframe.

Converts in place

Parameters:
  • df (pd.Dataframe or pd.Series) – The dataframe or series to modify

  • hed_schema (HedSchema or None) – The schema to use to identify defs

  • def_dict (DefinitionDict) – The definitions to expand

  • columns (list or None) – The columns to modify on the dataframe

get_assembled(tabular_file, sidecar, hed_schema, extra_def_dicts=None, join_columns=True, shrink_defs=False, expand_defs=True)[source]

Load a tabular file and its associated HED sidecar file.

Parameters:
  • tabular_file – str or TabularInput The path to the tabular file, or a TabularInput object representing it.

  • sidecar – str or Sidecar The path to the sidecar file, or a Sidecar object representing it.

  • hed_schema – HedSchema If str, will attempt to load as a version if it doesn’t have a valid extension.

  • extra_def_dicts – list of DefinitionDict, optional Any extra DefinitionDict objects to use when parsing the HED tags.

  • join_columns – bool If true, join all HED columns into one.

  • shrink_defs – bool Shrink any def-expand tags found

  • expand_defs – bool Expand any def tags found

Returns:

hed_strings(list of HedStrings):A list of HedStrings or a list of lists of HedStrings def_dict(DefinitionDict): The definitions from this Sidecar

Return type:

tuple

process_def_expands(hed_strings, hed_schema, known_defs=None, ambiguous_defs=None)[source]

Gather def-expand tags in the strings/compare with known definitions to find any differences

Parameters:
  • hed_strings (list or pd.Series) – A list of HED strings to process.

  • hed_schema (HedSchema) – The schema to use

  • known_defs (DefinitionDict or list or str or None) – A DefinitionDict or anything its constructor takes. These are the known definitions going in, that must match perfectly.

  • ambiguous_defs (dict) – A dictionary containing ambiguous definitions format TBD. Currently def name key: list of lists of HED tags values

Returns:

A tuple containing the DefinitionDict, ambiguous definitions, and errors.

Return type:

tuple

shrink_defs(df, hed_schema, columns=None)[source]

Shrink (in place) any def-expand tags found in the specified columns in the dataframe.

Parameters:
  • df (pd.Dataframe or pd.Series) – The dataframe or series to modify

  • hed_schema (HedSchema or None) – The schema to use to identify defs.

  • columns (list or None) – The columns to modify on the dataframe.