Why Should You Take Big Data?
Think of a business that relies on quick, agile decisions to stay competitive, and most likely big data analytics is involved in making that business tick. 53% Of Companies Are Adopting Big Data.
The average base pay for at least six big data skills itself is well over $120,000 a year.
Big data has seen massive exponential growth leading to numerous career opportunities.
It Stretches Your Mind, Think Better And Create Even Better.
Introduction to Big Data
Why Big Data?
Characteristics of Big Data – 4 Vs
Applications of Big Data
Introduction to Hadoop
HDFS – Hadoop Distributed file system
Components of HDFS
HDFS high availability
Role of zoo keeper
Replica pipeline and network distance algorithm
HDFS Read and Write
Installing Hadoop in Windows/Mac using Cloudera Quickstart VM
Introduction to Map Reduce Framework
Mapper and Reducer APIs
First Map Reduce program – Word Count
Map Reduce examples – Inverted Index and Titanic Data Analysis
Modes of execution
Job execution in MRV1 VS YARN
Serialization and Deserialization
Input and Output Formats
Using Partitioner and Combiner classes
Joins – Map side Join and Reduce side Join
Sequence File Format
Optimizing techniques of MR jobs
Hadoop Streaming and Pipes
Introduction to Hive
RDBMS VS Hive
Hive DDL : Managed Table VS External Table
Issues with delimiters
Partioning – Static and Dynamic
Dealing JSON data – using JSON SerDe
File Formats – Avro, Parquet, ORC
Introduction to Pig
Why Pig ?
Motivation by example
SQL vs Pig
Modes of Running Pig
Introduction to Pig Latin
Pig Latin Data types
Pig Latin Operators
Type casting and validation
Process of Pig Latin Processing
Pig UDF with example
What is a No SQL Database ?
Why Hbase ?
Introduction to Hbase
Hbase high level architecture
Indepth architectural view of Hbase
Java APIs for Hbase operations
Bulk Load using Table Mapper and Table Reducer API
Bulk Load using import TSV tool from a fi
Introduction to Sqoop
Sqoop imports and Exports With Examples
Introduction to Oozie
Oozie Action Tags
Flume – Spooling Directory
Programming concepts of Scala
Introduction to Spark
Why Spark ?
Applications of Spark
Architecture of Spark
Transformations and Actions
Spark SQL – Data Frames , Data Sets and SQL
Realtime streaming with Kafka and Spark Streaming
Learn and absorb new things with creative projects.
Batch Processing of e-commerce data using Hadoop stack
For large and small retailers, it is essential to be able to not only react to, but also accurately predict the trends and nuances in the market. Data analytics is done on the retail dataset in order to improve the profit of the organization.
Real Time Data Ingestion and processing of Social Media data using Hadoop stack
Data Ingestion of social media data like facebook/twitter to analyse the trends of various products.
Life at Digital Lync
The environment at Digital Lync is colorful and creative. It is where ideas are incubated and generated. An apt place to explore yourself.
Inspiring student stories.
Here are stories of real knowledge, real people, and real innovation.
Come and chat with us about your goals over a cup of coffee
1st Floor, Plot No: 6-11, survey No., 40 Khajaguda, Naga Hills Rd, Madhura Nagar Colony, Gachibowli Hyderabad, Telangana 500008
Phone: +91 8688444666
Address: #106 & 107, Manjeera Trinity Corporate. Near Manjeera Mall, Kukatpally, Hyderabad, Telangana 500072
Phone: +91 8688444666
11, Pusat Dagang Seksyen 16 Seksyen 16, 46350 Petaling Jaya Selangor, Malaysia
Phone: +60 80112 44239
#23664, Richland Grove Dr, Ashburn, VA 20148