Cathyos provides Big data services, big data consulting and Big data resources
on following big data technology stack:
Apache Hadoop is a set of open-source distributed processing framework that manages data processing and
storage for massive amounts of data applications running in clustered.
MapReduce is a programming model and a processing technique for the distributed algorithm based on java
platform. Map R or MapReduce is the heart of Apache Hadoop. MapReduce is a programming paradigm for easily
writing applications which processes and generates big data sets in-parallel on clusters.
HBase is an open-source and column-oriented database management system that runs on top of Hadoop
Distributed File System (HDFS). Hbase is written in Java and is a non-relational and distributed database
modeled after Google's Bigtable.
Oracle NoSQL Database is the simplest key-value database from Oracle Corporation. It is a is a NoSQL-type
distributed key-value store in which every single item in the database is stored as a key and value. It
provides mechanisms for data manipulation, horizontal scalability, and simple administration and
monitoring. Cathyos teams have expertise in Mongo, ReThink, and Cassandra DBs too.
Cloudera is the largest organization and the first one to develop and distribute Apache Hadoop based
software. It has the largest user base with many customers to their belt. The Cloudera Management suite
provides the users real-time node counts, reduced deployment time and many more.
Hortonworks is the only enterprise to offer a 100% open source distribution of Apache Hadoop with no
proprietary software labeled with it.
Apache Spark is an open-source cluster-computing framework. Originally developed at the Berkeley's
AMPLab, University of California. The Spark codebase was in future contributed to the Apache Software
Foundation, which has retained it back.
Amazon Elastic MapReduce (Amazon EMR) is a sub-project under Amazon Web Services
(AWS). It is a web service that enables us to control those massive Big Data datasets. Amazon
EMR has an array full of security features that allow us to reliably and securely manage big data, log
analysis, web indexing, data warehousing, machine learning, financial analysis, scientific simulation, and