AberdeenProject.utilities package

Submodules

AberdeenProject.utilities.loadConfigFile module

AberdeenProject.utilities.loadConfigFile.loadConfigFile()[source]

This function parses and extracts data from the yaml configuration file

Returns:Variables defined by the user in the configuration file
Return type:Dictionary

AberdeenProject.utilities.loadPickledData module

AberdeenProject.utilities.loadPickledData.loadPickledData(filepath)[source]

This function reads the pickled representation of an object from a file

Parameters:filepath (String) – Path to the pickled file
Returns:The reconstituted dataframe
Return type:Pandas dataframe

AberdeenProject.utilities.pklToCsv module

AberdeenProject.utilities.pklToCsv.pklToCsv(pathToPklFile, pathToCsvFile)[source]

This function converts a pkl file into a csv file

Parameters:
  • pathToPklFile (String) – Path to the pkl file
  • pathToCsvFile (String) – Path to the csv file

AberdeenProject.utilities.statistics module

class AberdeenProject.utilities.statistics.Statistics(filepath)[source]

Bases: object

This class generates descriptive statistics of a Pandas dataframe

getColumnsStatistics()[source]

This function returns a Series containing counts of unique values per column

Returns:Dictionary containing all counts of unique values per column
Return type:Dictionary
getKeptColumns()[source]

This function returns the remaining columns of a Pandas dataframe

Returns:Remaining columns of a Pandas dataframe
Return type:List
getNumKeptColumns()[source]

This function returns the number of remaining columns of a Pandas dataframe

Returns:Number of remaining columns
Return type:Integer
getShape()[source]

This function returns the shape of a given Pandas dataframe

Returns:Shape of a Pandas dataframe
Return type:Tuple
getUniqueValues()[source]

This function returns all unique values per column

Returns:Dictionary containing all unique values per column
Return type:Dictionary

Module contents