
4 AI and Analytics trends to watch for in 2020-2021
Never did we imagine the fictional robotic characters in novellas to become a reality. However, we wished, didn’t we? The theory of ‘Bots equal to…
Never did we imagine the fictional robotic characters in novellas to become a reality. However, we wished, didn’t we? The theory of ‘Bots equal to…
Apache Hive is a data warehouse software project built on top of Apache Hadoop for the querying of large data systems in the open-source Hadoop…
Big Data refers to all the data that is generated across the globe at an unprecedented rate. This data could be either structured or unstructured.…
Have you heard of Cassandra? Wikipedia describes her quite aptly: "Apache Cassandra is a free and open-source distributed NoSQL database management system designed to handle large amounts…
Big data is a big phenomenon—one that can overwhelm you in an unimaginably large scale. With the progress of technology firms through the Internet, it…
In the plethora of monitoring tools, ranging from open source to paid, it is always a difficult choice to decide which one to go for.…
1. What is Apache Spark RDD? Apache Spark RDD stands for Resilient Distributed Datasets. RDD is a fault tolerant, immutable collection of elements which can…
MSys Advanced Log Analytics MSys Technologies' lab is developing a Log Analytics tool which will collect logs, store the logs and do the analytics on the…
The Problem Statement The Otto Product Classification Challenge was a competition hosted on Kaggle, a website dedicated to solving complex data science problems. The purpose…
In a previous blog post we have seen what Apache Mesos is and how it helps to create dynamic partitioning of our available resources which…
Let’s start with an introduction to what IT across the globe calls “big data.” From a use case perspective, few terms are so overused and…
Today, every IT-related service online or offline is driven by data. In the last few years alone, explosion of social media has given rise to…