Suraj Nathani

Suraj has deep expertise across clouds (AWS, Azure, Google cloud, IBM Softlayer, and IBM Bluemix), BigData tools (Spark, Storm, Kafka, Beam and Impala), DevOps tools (Anisble, Packer, Terraform, CFM, ARM, Chef, Puppet, and Ansible), IoT services (AWS IoT, and Azure IoT), NoSQL databases (HBase, Cassandra, Redis, MongoDB, and Elasticsearch), OSes (Linux, Unix and Windows) and programming languages (Java, C# and Python).
Find me on:

Recent Posts

Lower TCO and Increase Query Performance by Running Hive on Spark in Amazon EMR

Apr 23, 2020 8:30:00 AM / by Suraj Nathani posted in Data Analytics, Migration, Amazon EMR


This blog was first published by same authors on Amazon APN Blogs. 

As mentioned in the first post in our series, Seagate Technology asked Mactores Cognition to evaluate and deliver a data platform to process petabytes of data with consistent performance, lower query processing time, lower total cost of ownership (TCO), and the scalability required to support about 2,000 daily users.

Read More

Optimizing Presto SQL on Amazon EMR to Deliver Faster Query Processing

Apr 13, 2020 8:54:00 AM / by Suraj Nathani posted in Data Analytics, Big Data


Seagate Technology is a United States-based data storage company with worldwide manufacturing facilities that generate huge amounts of manufacturing and testing data.

Read More