Suraj Nathani

Suraj has deep expertise across clouds (AWS, Azure, Google cloud, IBM Softlayer, and IBM Bluemix), BigData tools (Spark, Storm, Kafka, Beam and Impala), DevOps tools (Anisble, Packer, Terraform, CFM, ARM, Chef, Puppet, and Ansible), IoT services (AWS IoT, and Azure IoT), NoSQL databases (HBase, Cassandra, Redis, MongoDB, and Elasticsearch), OSes (Linux, Unix and Windows) and programming languages (Java, C# and Python).

Blogs and Articles

Lower TCO, Boost Query Performance: Run Hive on Spark in Amazon EMR

Blog

This blog was first published by same authors on Amazon APN Blogs. As mentioned in the first post in our series, Seagate Technology asked Mactores C...

Apr 23, 2020 by Suraj Nathani

Optimizing Presto SQL on Amazon EMR to Deliver Faster Query Processing

Blog

Seagate Technology is a United States-based data storage company with worldwide manufacturing facilities that generate huge amounts of manufacturing a...

Apr 13, 2020 by Suraj Nathani

Lower TCO and Increase Query Performance by Running Hive on Spark in A...

External

With petabytes of data accumulated over 20 years, and more being generated each day, it was imperative to have systems in ...

Apr 23, 2020 by Suraj Nathani

Optimizing Presto SQL on Amazon EMR to Deliver Faster Query Processing

External

We explain the three different migration options, the results each one produced, and the architecture of the solution ulti...

Apr 3, 2020 by Suraj Nathani