Apache Avro: Data Serialization for Distributed Applications Training Course
Course
In City Of London
Description
-
Type
Course
-
Location
City of london
This course is intended for
Developers
Format of the course
Lectures, hands-on practice, small tests along the way to gauge understanding
Facilities
Location
Start date
Start date
Reviews
Subjects
- Apache
Course programme
Principles of distributed computing
- Apache Spark
- Hadoop
Principles of data serialization
- How data object is passed over the network
- Serialization of objects
- Serialization approaches
- Thrift
- Protocol Buffers
- Apache Avro
- data structure
- size, speed, format characteristics
- persistent data storage
- integration with dynamic languages
- dynamic typing
- schemas
- untagged data
- change management
Data serialization and distributed computing
- Avro as a subproject of Hadoop
- Java serialization
- Hadoop serialization
- Avro serialization
Using Avro with
- Hive (AvroSerDe)
- Pig (AvroStorage)
Porting Existing RPC Frameworks
Apache Avro: Data Serialization for Distributed Applications Training Course
