Skip to content

Data Processing

cleaning

  • removing missing data
  • remove duplicate data
  • remove irrelevant and inconsistent data

enriching

  • connect data from third party to enrich data
  • aggregate data to find patterns

services provided

  • google dataproc (spark & hadoop support)
    • report system
    • batch processing
  • google dataflow (apache beam)
    • batch & chain processing
    • real-time processing
  • google dataprep
    • visualise data
    • identify redundant data
    • processing for quick
    • json -> table
  • cloud datafusion
    • simplify building ETL pipelines
    • low-code

On this page