WIP
Most Common
Mapper / Cross walk (likely the most common activity)
- Map field name A → B
Filter
- Discard field, row, etc
- Validation
- Normalization
ID Replacement
- Replace an ID with another. Usually from some structural difference in schemas.
Aggregator
- Add known values from source B not present in source A
- Join data into aggregate roots
Less likely to be needed
De-duplication
- Identify and remove duplicates
Pivot
- e.g. pivot table. Often used to de-normalize
Supporting Tech?
- Key Question is we know that Nifi /can/ do these things, the question is it the best tool for this job? Key concern with Nifi has been managing it.