Victor Oketch Sabare – Medium

Victor Oketch Sabare

Pinned

Comparing Spark and MapReduce: The Pros and Cons of Two Popular Big Data Processing Frameworks on…

Spark and MapReduce are both popular big data processing frameworks that run on the Hadoop ecosystem. Both have their own unique features…

Jan 9, 2023

Comparing Spark and MapReduce: The Pros and Cons of Two Popular Big Data Processing Frameworks on…

Jan 9, 2023

Pinned

Unlocking the Power of Big Data Processing with Resilient Distributed Datasets

A resilient distributed dataset (RDD) is a fundamental data structure in the Apache Spark framework for distributed computing. It is a…

Jan 10, 2023

Unlocking the Power of Big Data Processing with Resilient Distributed Datasets

Jan 10, 2023

Boosting R Performance with Parallel Processing pacakge snow

Understanding Parallel Computing

Sep 3, 2024

Boosting R Performance with Parallel Processing pacakge snow

Sep 3, 2024

Exploring Outliers, Leverage, and Influence

Unveiling Hidden Insights in Data Analysis

Jul 6, 2023

Exploring Outliers, Leverage, and Influence

Jul 6, 2023

Building a Data Pipeline for Blockchain Data with Apache Kafka and Apache Flink

The rise of blockchain technology has brought about an explosion in the amount of data being generated and consumed by blockchain networks…

Mar 16, 2023

Building a Data Pipeline for Blockchain Data with Apache Kafka and Apache Flink

Mar 16, 2023

Introduction to Streamlit for Data Engineering

Data engineering is a critical aspect of any data-driven organization, where data scientists and analysts work with large amounts of data…

Mar 12, 2023

Introduction to Streamlit for Data Engineering

Mar 12, 2023

Published in
Towards Data Engineering

Building a Real-time Fraud Detection System with Apache Kafka and Apache Storm — A Step-by-Step…

Introduction

Jan 31, 2023

Building a Real-time Fraud Detection System with Apache Kafka and Apache Storm — A Step-by-Step…

Jan 31, 2023

Building a data pipeline for natural language processing with Apache Kafka and Apache Spark.

Are you tired of slow and clunky data pipelines for your natural language processing (NLP) projects? Well, buckle up because we have the…

Jan 31, 2023

Building a data pipeline for natural language processing with Apache Kafka and Apache Spark.

Jan 31, 2023

Published in
Towards Data Engineering

Building a Scalable and Real-time Data Pipeline for Social Media Analytics with Apache Kafka and…

Introduction

Jan 16, 2023

Building a Scalable and Real-time Data Pipeline for Social Media Analytics with Apache Kafka and…

Jan 16, 2023

Mastering Missing Data: A Comprehensive Guide with Code Examples and Illustrations on How to Handle…

Handling missing data in a data pipeline can be a tricky task, but with the right approach, it can be effectively managed. In this article…

Jan 13, 2023

Mastering Missing Data: A Comprehensive Guide with Code Examples and Illustrations on How to Handle…

Jan 13, 2023

Victor Oketch Sabare

Victor Oketch Sabare

Data Geek passionate about Data Science, Data Engineering and Artificial Intelligence. Keen on using data to drive business insights and improve efficiency.

Following

Help
Status
About
Careers
Press
Blog
Privacy
Rules
Terms
Text to speech