Sii Poland

SII UKRAINE

SII SWEDEN

  • Trainings
  • Career
Join us Contact us
Back

Sii Poland

SII UKRAINE

SII SWEDEN

Back

Apache Spark – building systems for real-time data processing

Language Polish, English

  • The number of participants 8-15 people
  • Duration 2 days

Why take this course

If you want to learn a distributed computing platform and build applications that process large amounts of data, then this course is for you. You’ll gain knowledge and an understanding of Apache Spark’s architecture and learn how to build scalable applications. You will be able to create data pipelines using Apache Spark and discover modern concepts that solve problems in the world of Big Data, such as Delta Lake.

What you'll learn

  • Create applications using Apache Spark.
  • Build applications that process large volumes of data.
  • The architecture of the Apache Spark platform.
  • The development environment.
  • The principles of building scalable applications.
  • How to build data pipelines using Apache Spark.
  • Modern concepts that address many common problems in the world of Big Data (Delta Lake).

Certification & Exam

Upon completion of the training, you will receive a certificate of participation, confirming your skills in creating data processing applications with Apache Spark. There is no final exam—all you need is active participation in the classes.

Who is this course for

This course is designed for anyone involved in the software development process and members of project teams. The course is also for developers who know the basics of programming and want to learn the fundamentals of the Scala language, which are necessary for writing effective applications using Apache Spark.

Topics covered

  • Apache Spark – architecture
  • Historical overview
    • Solution architecture
    • Running the application
    • Monitoring
    • Troubleshooting / debugging
  • Data processing with Apache Spark
    • RDDs, DataFrames, and DataSets
    • Spark SQL
    • Joins
    • File formats
    • Data aggregation
  • Preparing the development environment to work with Apache Spark (part conducted using Scala)
    • Working with IntelliJ
    • Introduction to SBT
    • Passing parameters / configuration using external libraries
    • Code testing
  • Delta Lake – a format that facilitates data processing
    • Introduction to the concept
    • Writing / reading data using the Delta format
    • The most important functions and differences compared to classic files (Parquet / ORC)
Interested in training?
Contact us to get more information

Contact our expert

Your file

Uploaded file:
  • file_icon Created with Sketch.

Acceptable files: doc, docx, pdf. (max 5MB)
Please submit your file in DOC, DOCX or PDF format
The upload size is limited to 5 MB
File is empty
File was not uploaded

At any time, you may withdraw your consent to the processing of personal data, but such withdrawal shall not affect the legal compliance of any processing of such data, which had occurred before you withdrew your consent. Detailed information on the processing of your personal data is specified in the Privacy Policy.

Anna

Public trainings coordinator

Your message was sent successfully

We will look over your message and get back to you as soon as possible

Sorry, something went wrong and your message was not delivered

Refresh the page and try again. Contact us, if problem occurs again

We’re sorry, but the selected file appears to be damaged and we can't process it.

Please try uploading a different copy or a new version of the file. Contact us, if problem occurs again.

Processing…

ITIL® and PRINCE2® are registered trademarks of AXELOS Limited, used under permission of AXELOS Limited. All rights reserved. AgilePM® is a registered trademark of Agile Business Consortium Limited. All AgilePM® Courses are offered by Sii, an Affiliate of Eraneos Iberia S.L.U., an Accredited Training Organization of The APM Group Ltd. Lean IT® Association is a registered trademark of the Lean IT Association LLC. All rights reserved. Sii is an Affiliate of Accredited Training Organization Eraneos Iberia S.L.U. SIAM™ is a registered trademark of EXIN Holding B.V. All prices presented on the website are net prices. 23% VAT should be added.

Get in touch Find training

Änderungen im Gange

Wir aktualisieren unsere deutsche Website. Wenn Sie die Sprache wechseln, wird Ihnen die vorherige Version angezeigt.

Einige Inhalte sind nicht in deutscher Sprache verfügbar.
Sie werden auf die deutsche Homepage weitergeleitet.

Möchten Sie fortsetzen?