Integration with leading Hadoop distributions, object stores, NoSQL stores and analytic databases, as well as log file data and JSON/XML formats.Visual design environment for blending multiple big data sources (see Figure 3) and processing data at scale.Pentaho Is the Leading Solution for Big Data Integration and Analytics Pentaho covers the entire big data life cycle, from data extraction and preparation of diverse data, to scalable processing on Spark and Hadoop, leading to end-to-end analytics solutions.
The Pentaho platform enables companies to realize business value from large volumes of diverse data by dramatically reducing the time and complexity required to design, develop and deploy big data analytics. Native integration with the Lumada Data Catalog, a component of the Lumada DataOps Suite.Enterprise-grade administration, scalability, load balancing, and security capabilities.Support for advanced analytic model development in R, Python, Scala and Weka that incorporate libraries, such as scikit-learn, Spark MLlib, Tensorflow and Keras, into the data flow.Robust orchestration capabilities to coordinate complex workflows, including scheduling and alerts.Direct access to complete analytics, including charts, visualizations and reporting from any step of PDI.
Pentaho’s open, embeddable technology (see Figure 1) supports flexible analytics that both leverage existing data infrastructure and future-proof deployments against tomorrow’s inevitable changes. Pentaho is part of the Lumada DataOps Suite, which provides intelligent data management for digital innovation. Pentaho data integration and analytics technology enables organizations to access, prepare, and analyze all data from any source, in any environment.