Big Data on the Google Cloud – Apache Beam, DataFlow, BigQuery

If you work with a huge amount of data, from either the analyst or the developer side, and you are always looking at what’s next regarding Big Data technologies, come join us and explore Apache Beam and the Google BigData platform on Doctusoft’s workshop.

Get introduced to the Apache Beam model

Build and execute batch and streaming pipelines using Google Cloud Dataflow and other Big Data service on the GCP platform such as Cloud Pub/Sub and Google BigQuery

See how easy to run your pipelines on other runners like Apache Spark

Learn about real business use cases and project experiences.

Get the full picture about how these Big Data products differ from other well-known solutions and know which one to choose to suit your business needs or the technological requirements you work with.

Whether you come from a small start-up or a big multinational company, this workshop is useful for anyone who wants to learn first-hand how to deal with Apache Beam and Google’s Big Data services.

Participants should have a technology background, basic programming skills in Java and be open to sharing their thoughts and questions.

Participants will need to bring their own laptops and have a free Google account. Further information about the technical environment will be communicated after registration.

Technical requirements:

A laptop with 4 GB RAM, 10 GB free disk space

Oracle VirtualBox 5+ installed

A free Google account

A GCP free-trial registration – https://cloud.google.com/free-trial.
if somebody already have been used this trial, contact us.

Csaba Kassai

Csaba has been a software architect at Doctusoft ‒ the only Google Cloud Platform partner in Hungary ‒ for 6 years. He has participated in several Big Data projects, solving the problems of different retail, telecommunication, and start-up companies using Google and Hadoop technologies. He has also worked for one of the biggest banks in Hungary on Big-Data-focused projects such as optimizing the query time of the transaction history database with ElasticSearch. Csaba’s main professional interests are Google’s Big Data products and their related programming languages and database technologies.