Icon

Data Transformation & Analysis Profile

Data Transformation and Analysis specialists wrangle datasets to make them compatible with multiple analytical processes.


Qualified Members: 11


Competencies by Tier

Data Collection


Competency
Description
Applying data security and encryption techniques according to data governance guidelines Recalls why some data is considered sensitive or requires extra security concerns.
Recognizing legal and ethical ramifications of data protection during data collection Defines what is meant by "informed consent".
Integrating secondary data into data collection plan Defines what is meant by "secondary data". Lists examples of common operational datasets (CODs).
Critically assessing secondary data sources Recognizes the features of a reliable and credible data source. Outlines basic methods for assessing the reliability and utility of a data source.

Data Storage & Management


Competency
Description
Cleaning and manipulating data for improved quality Names common data cleaning steps that need to be performed on a dataset.
Structuring data and constructing a data model Recognizes elements of a structured table versus unstructured dataset. Lists examples of common file formats (csv, xls, etc.).
Storing data according to data governance guidelines Correctly identifies organizational/team data storage solutions. Demonstrates basic proficiency with ODK Central and KoBoToolbox.
Backing up data according to data governance guidelines Recognizes why data must be backed up and recommended intervals for backing up data.
Establishing and maintaining a content management system Describes what is meant by "content management system".
Integrating data into data model using API Defines application programming interface (API) and how it can be leveraged in information management tasks.
Preventing and mitigating risks of data breaches according to data governance guidelines Identifies when security measures should be implemented for local vs. web sharing of data. Describes that a data protection impact assessment (DPIA) is. Describes what a data breach is.
Establishing and applying user permissions according to data governance guidelines Defines what is meant by "user permissions" versus "user roles".
Establishing a geospatial database Differentiates between file storage formats and identifies pros/cons of each - topojson, shapefile, spatialite, JSON, GeoJSON, GeoDB, GeoTIFF.
Structuring vector and raster data models Describes how attribute matching takes place. Performs basic data transformation/restructuring tasks.

Data Analysis


Competency
Description
Conducting secondary data analysis Describes how to perform secondary data analysis.
Performing quantitative data analysis Outlines methods and techniques to describe and explain data (descriptive statistics). Identifies examples of structured analysis techniques. Recognizes components of the IFRC Analysis Spectrum.
Performing qualitative data analysis Identifies methods to analyze qualitative data. Sets up qualitative analysis spreadsheet.
Developing a data analysis plan using an analytical framework Explains what an analytical framework and data taxonomy are and provides reasons to use them.
Performing geographic/spatial analysis Defines basic geographic/spatial analysis functions (for example, clip, buffer, and dissolve).

Data for Decision-Making


Competency
Description
Sharing data in an appropriate and secure manner according to data governance guidelines Describes the data sharing principles used in the humanitarian sector. Explains how to comply with the various components of open data policies and data governance guidelines.
Identifying patterns and trends in data and using them to inform response analysis Recognizes ways to identify patterns in data.

Fringe Specialties


Competency
Description
Identifies appropriate applications for machine learning algorithms as a tool to strengthen operational data use Recognizes when modeling is needed and feasible. Explains the representativeness of the data and the type of problem at hand (need of supervised learning models vs unsupervised models). Can outsource machine learning request.

Data Collection


Competency
Description
Applying data security and encryption techniques according to data governance guidelines Identifies data that may be considered personally identifiable information (PII), sensitive personal information (SPI), or otherwise needing extra security considerations.
Recognizing legal and ethical ramifications of data protection during data collection Identifies examples of various power imbalances in humanitarian response situations.
Integrating secondary data into data collection plan Provides examples of common sources of secondary data and methods for finding them (for example, census, news reports). Finds and describes how to use common operational datasets (CODs).
Critically assessing secondary data sources Assesses and makes a conclusion regarding the reliability and credibility of a data source. Describes advantages and pitfalls of expert opinions. Describes the implications of various licensing for external data sources.

Data Storage & Management


Competency
Description
Cleaning and manipulating data for improved quality Conducts basic data cleaning using tools in spreadsheet software.
Structuring data and constructing a data model Designs structured tables in spreadsheet software for the collection of various data. Describes use cases for common file formats (csv, xls, etc). Describes the elements/taxonomy of a data model (i.e. entity types, attributes, relationships, etc.). Sets up a simple logical data model.
Storing data according to data governance guidelines Successfully adheres to guidelines for data entry and storage (for example, for a shared operations Dropbox). Identifies situations where various mobile data collection server options may be appropriate and the features, strengths, and weaknesses of each. Assesses compatibility of different apps with the various server options (for example, ODK Collect versus KoBo Collect). Performs the steps to upload forms to and download forms from the various servers.
Backing up data according to data governance guidelines Successfully adheres to guidelines for backing up data and contingency plans.
Establishing and maintaining a content management system Demonstrates proficiency with using a content management system.
Integrating data into data model using API Follows step-by-step guidance to access data via an API request.
Preventing and mitigating risks of data breaches according to data governance guidelines Complies with institutional/team data security protocols. Uses data collection and data management solutions structured around the principles of data protection. Identifies potential gaps in technology solutions that may impact data protection. Describes the steps involved in conducting a DPIA.
Establishing and applying user permissions according to data governance guidelines Guides and applies appropriate user permissions for database storage methods (for example, cloud storage and mobile data collection servers).
Establishing a geospatial database Uses a variety of tools for converting geographic file formats (e.g. shapefile to GeoJSON). Prepares files (tables, vector datasets, raster datasets) as tables within geospatial database.
Structuring vector and raster data models Joins tabular data to spatial data through attribute matching. Joins multiple dataset attributes to one data layer using spatial or tabular attributes.

Data Analysis


Competency
Description
Conducting secondary data analysis Describes how and when to use secondary data analysis tools.
Performing quantitative data analysis Prepares data for analysis (for example, transformation or restructuring of data). Describes the IFRC Analysis Spectrum and main structured analysis techniques. Applies basic descriptive statistics (for example mean, median, mode, average, etc.) and identifies their limitations (that is, levels 1 and 2 of the IFRC Analysis Spectrum). Clearly identifies and explains underlying assumptions in own analysis.
Performing qualitative data analysis Prepares data for analysis (for example, transformation or restructuring of data). Develops codebooks and applies inductive and/or deductive coding techniques to a qualitative data set. Clearly identifies and explains underlying assumptions in own analysis.
Developing a data analysis plan using an analytical framework Supports development of a data analysis plan using an analytical framework. Illustrates how to produce a map of stakeholders that demonstrates their interactions with the data cycle (for example, data owners, contributors, and consumers).
Performing geographic/spatial analysis Applies basic geographic/spatial analysis functions (for example, clip, buffer, and dissolve). Describes cartographic standards and conventions. Describes coordinate systems and map projections and their impacts on distance, size, area, and shape.

Data for Decision-Making


Competency
Description
Sharing data in an appropriate and secure manner according to data governance guidelines Successfully adheres to data sharing principles used in the humanitarian sector. Produces examples of data available via the International Aid Transparency Initiative (IATI). Complies with institutional open data policy and data governance guidelines.
Identifying patterns and trends in data and using them to inform response analysis Applies appropriate strategies to identify patterns in data. Detects data gaps (what is needed to be able to conduct the analysis and make the products).

Fringe Specialties


Competency
Description
Identifies appropriate applications for machine learning algorithms as a tool to strengthen operational data use Demonstrates proficiency in the classic ML models (such as linear regression, k-means, decision trees, naive bayes, support vector machines), being able to discern which is the most adequate model for a specific task, and when a result is good enough or not. Utilizes Business Intelligence Software or other available ML software, to gain insight from the available data at hand.

Data Collection


Competency
Description
Applying data security and encryption techniques according to data governance guidelines Implements basic file encryption using software features and understanding of the limitations of such methods (for example, password protection on Microsoft Office documents).
Recognizing legal and ethical ramifications of data protection during data collection Advocates for matching data collected to data needs, that is, collecting only what you need and for which you have an analysis plan.
Integrating secondary data into data collection plan Identifies and triangulates different sources for a given data interest (for example, damage estimates for a given area from government, UN, NGOs, etc.).
Critically assessing secondary data sources Identifies main sources of errors and ways to mitigate them.

Data Storage & Management


Competency
Description
Cleaning and manipulating data for improved quality Cleans data using software developed specifically for that purpose according to data standards. Creates guidelines and basic controls to improve data collection quality (for example, written guidance or dropdowns in a spreadsheet template).
Structuring data and constructing a data model Differentiates between relational and non-relational databases. Uses advanced file formats for data collection and transformation (for example, macro-enabled workbooks). Sets up a simple physical data model.
Storing data according to data governance guidelines Designs the organized storage of files in such a way that other people can use and find them (for example, a logical folder hierarchy for cloud storage). Sets up and uses ODK Central. Implements sharing, user permissions, and authentication requirements for the various mobile data collection servers.
Backing up data according to data governance guidelines Designs and implements data backup and contingency plans.
Establishing and maintaining a content management system Describes the core functionalities necessary in a content management system.
Integrating data into data model using API Establishes a live connection between a web API and a database or data visualization tool.
Preventing and mitigating risks of data breaches according to data governance guidelines Implements mitigation measures to avoid a data breach or minimize its impact. Detects when data breaches occurs and notifies the proper channels. Participates in a DPIA or in the investigation of a data breach.
Establishing and applying user permissions according to data governance guidelines Guides and applies appropriate user permissions for database storage methods (for example, cloud storage and mobile data collection servers).
Establishing a geospatial database Indexes spatial datasets in a manner that supports performant queries.
Structuring vector and raster data models Introduces raster, vector, satellite/aerial image data into data model, as appropriate.

Data Analysis


Competency
Description
Conducting secondary data analysis Conducts secondary data analysis using appropriate statistical processes.
Performing quantitative data analysis Identifies and uses appropriate, statistically valid approaches to summarize and interpret available data. (that is, level 3 of the IFRC Analysis Spectrum). Performs structured analysis. Critically assesses an analysis for underlying assumptions that may not be clearly stated. Assesses and clearly communicates the limitations of the conclusions of an analysis. Identifies ways to mitigate the effects of sources of errors (for example, ways of thinking, and structured analytical techniques). Applies appropriate techniques to assess uncertainty and confidence in analysis results.
Performing qualitative data analysis Applies basic qualitative analysis methods and identifies their limitations (for example, content or thematic analysis). Performs basic functions with a qualitative analysis software. Critically assesses an analysis for underlying assumptions that may not be clearly stated. Assesses and clearly communicates the limitations of the conclusions of an analysis.
Developing a data analysis plan using an analytical framework Adapts an analytical framework for a specific operational context. Identifies when geospatial attributes are required for analysis or reporting.
Performing geographic/spatial analysis Performs geographic/spatial analysis (for example, nearest neighbor, overlay). Conducts raster data analysis (for example, supervised classification, and raster calculations). Writes queries within spatial context.

Data for Decision-Making


Competency
Description
Sharing data in an appropriate and secure manner according to data governance guidelines Describes the methods, formats, and protocols for improved interoperability, sharing, and accessibility both internally and with other humanitarian partners (HXL, metadata, ISO standards, common operational data models, place codes, IATA Standards). Shares and accesses data through several platforms (Humanitarian Data Exchange (HDX), Humanitarian Response Info, Open Street Map (OSM), Relief Web). Develops data sharing and open data guidance/job aids.
Identifying patterns and trends in data and using them to inform response analysis Recognizes patterns, trends, outliers and anomalies in datasets.

Fringe Specialties


Competency
Description
Identifies appropriate applications for machine learning algorithms as a tool to strengthen operational data use Displays familiarity with different data structures such as stacks, queues, arrays, linked lists, graphs, trees, and multi-dimensional arrays. Utilizes advanced data science methods from standard libaries and available code repositories and/or software. Uses cloud computing when necessary. Applies skills in natural language processing, image, audio and video processing. Recognizes the different components in a neural network: dataset, architecture, loss function, optimization method, and performance metrics.

Data Collection


Competency
Description
Applying data security and encryption techniques according to data governance guidelines Applies advanced encryption techniques for online and offline data collection, storage, and transfer.
Recognizing legal and ethical ramifications of data protection during data collection Designs program/project requirements to follow various legal frameworks (for example, General Data Protection Regulation (GDPR)). Communicates the implications of drone and satellite data imagery collection on privacy, including compliance with the rules and regulations.
Integrating secondary data into data collection plan Integrates secondary data, targeted at specific audiences and decision points, into data collection plan. Selects appropriate technology and strategies to collate and store secondary data.
Critically assessing secondary data sources Comprehensively explains advantages and limitations of secondary data review

Data Storage & Management


Competency
Description
Cleaning and manipulating data for improved quality Designs quality assurance/ quality control (QA/ QC) procedures. Uses advanced skills with scripting languages for data manipulation.
Structuring data and constructing a data model Applies advanced skills with different database options. Creates advanced file formats for data collection and transformation (for example, macro-enabled workbooks). Utilizes contextually-appropriate data modeling techniques (i.e. hierarchical, object-oriented, network, entity-relationship, relational).
Storing data according to data governance guidelines Demonstrates flexibility to adapt a data management system and associated processes (for example, due to changed demands and context of a red disaster). Installs ODK Central on a cloud server. Installs KoBoToolbox on a cloud server. Sets up https for a cloud-based server.
Backing up data according to data governance guidelines Creates a data lifecycle plan that covers all stages from planning and collection through to deletion.
Establishing and maintaining a content management system Sets up and configures a content management system (for example, Wordpress, Drupal, and Sharepoint). Administers content management systems.
Integrating data into data model using API Integrates data systems with external APIs such as those built into data sharing platforms or mobile data collection servers.
Preventing and mitigating risks of data breaches according to data governance guidelines Manages implementation of a DPIA. Evaluates possible responses when data breaches occur (that is, minimizing threats, avoiding further compromise of systems, and protocols for communication with those affected). Conducts regular data vulnerability checks. Trains others on data security protocols.
Establishing and applying user permissions according to data governance guidelines Develops decision trees, threshold guidance, or other tools to guide data protection integration with business processes. Regularly reviews user permissions to ensure accuracy (i.e. removing permissions for former employees).
Establishing a geospatial database Converts datasets to different serialization formats as needed (binary, text, XML, JSON). Identifies appropriate use cases for spatial database management systems and implements accordingly.
Structuring vector and raster data models Trains others to construct an effective and queryable geographic data model.

Data Analysis


Competency
Description
Conducting secondary data analysis Structures databases relevant for the Secondary Data Review (SDR) during a natural or protracted crisis (sudden change of situation).
Performing quantitative data analysis Applies advanced statistical methods/inferential statistics and modeling to identify correlations, variances, path analyses, and to anticipate and prescribe data (that is, forecast and predict the evolution of a situation as described in levels 4 and 5 of the IFRC Analysis Spectrum). Authors and explains an appropriate, clear, and concise analysis report that will benefit and be accepted by the intended audiences and various stakeholders. Identifies and selects appropriate structured analytical techniques to mitigate the impact of biases (for example, processing, group, and selection biases).
Performing qualitative data analysis Demonstrates familiarity with a variety of qualitative analysis methods and when they are appropriate to use (for example, cross-case analysis, grounded theory, etc.). Applies advanced skills with a qualitative analysis software and is able to teach others to conduct analysis using that software.
Developing a data analysis plan using an analytical framework Develops an analysis or tabulation plan targeted at specific audiences and decision points. Leads development of a contextualized analytical framework.
Performing geographic/spatial analysis Conducts advanced geographic/spatial analysis (for example, voronoi diagram, nearest neighbor, calculating the center point (centroid) of polygons, boolean operations between multiple overlapping shapes (intersect, difference, union)). Applies spatial statistics (for example, spatial autocorrelation). Uses various scripting languages to process and analyze geospatial data. Describes the modifiable aerial unit problem (MAUP) and how to account for it in analysis.

Data for Decision-Making


Competency
Description
Sharing data in an appropriate and secure manner according to data governance guidelines Owns and monitors compliance with data sharing agreements. Trains others on data sharing best practices and open data. Adapts and contextualizes an open data policy for a specific group or institution.
Identifying patterns and trends in data and using them to inform response analysis Supports response option analysis (for example, suggesting programs to respond to identified needs). Identifies response gaps and provides recommendations for further action.

Fringe Specialties


Competency
Description
Identifies appropriate applications for machine learning algorithms as a tool to strengthen operational data use Propose new machine learning models for tackling specific challenges (e.g. use of active learning for data scarcity). Describes computability and complexity. Applies knowledge of computer architecture such as memory and clusters.