Data Processing
cleaning
- removing missing data
- remove duplicate data
- remove irrelevant and inconsistent data
enriching
- connect data from third party to enrich data
- aggregate data to find patterns
services provided
- google dataproc (spark & hadoop support)
- report system
- batch processing
- google dataflow (apache beam)
- batch & chain processing
- real-time processing
- google dataprep
- visualise data
- identify redundant data
- processing for quick
- json -> table
- cloud datafusion
- simplify building ETL pipelines
- low-code