Looks Like This Service Is On Hold

I will scaling your nextgen big data services

United Kingdom

I speak English, Persian, German

5 orders completed

I have more than ten years experience in machine learning, data science and AI. Among many packages my best expertise is with Python, R, SPSS modeler and weka. I have worked with Big Data in Stream Pr...
About this Gig

Welcome to the world of data lakes! I'm Ali Reza, and I'm here to help you design, build, and manage your data lake infrastructure. Whether it's HDFS, AWS S3, or Azure Data Lake Storage, I've got you covered.


I am working as a Big Data System Analyst and I am fully aware of the problems that clients are facing while using HDFS, AWS S3, or Azure Data Lake for their CI/CD pipelines. The ETL of streams is computationally expensive and more challenging in terms of efficiency and runtime in regards of acquiring extraction, transformation and loading in distributed aspects like IoT devices on API. Malfunctions can be occured, with the concurrency of heap sized redundencies on the testing types. I am skilled with the following Technologies/Tools :


1. Apache Spark

2. Apache Flink

3. Apache Kafka

4. PyFlink

5. PySpark

6. Data Warehousing

7. Hive and related languages

8. Hadoop Engineering

9. HDFS and other file systems responsible for Big Data

10. Github containerization by Pushing container images to Docker Hub

11. Handle Various File Formats like JSON, CSV, Text Files, Sequence Files etc. or other unknown Scheme

Expertise:

API integration

Automations

Big data

Classification

ETL

Technology:

Amazon Redshift

Apache Hadoop

Apache Spark