Clusters Introduction
With the Isthari Data as a Service platform you can easily define cluster for your Big Data Analytics project from an intuitive Web UI. You can choose from a wide selection of open source Big Data tools, including:
- Presto / Trino SQL
- Spark
- Spark SQL
- Spark Livy
- Jupyter
- Pyspark
- Spark R
- And many others
Metadata service
One key point for the success of your data lake is Data Governance. The Isthari Data as a Service platform includes a central Metadata Service for all your Big Data environments. Now your users can easily discover the data, the structure and the location.
This Metadata Service is based on the open source defacto standard: Hive Metastore
Serverless
Another vendors deploy clusters in the cloud 24x7. This is an inefficient, and costly, allocation of resources.
This Isthari Data as a Service unique serverless approach only deploy your clusters when really needed by users. This way you can greatly cut costs, while drastically improving time to market and time to value
Storaged detached from computing
Data is not attached to a single cluster, project or use case. Data and storage is a central piece of the Isthari Data as a Service platform. Different users, from different departments and projects can access the same data at the same time. This approach boost data democratization
Scalability
With the Isthari Data as a Service platform you can easily scale from one single use case to a full wide enterprise Data Lake. Everything from the same solution