The Apache Software Foundation (ASF) offers a wide range of tools, libraries, frameworks, and data stores for building enterprise applications. The purpose of this list is to keep track of what they are and what their current status is, with a strict focus on enterprise apps.
This document is a constant work in progress and will probably never be finished, not because I am lazy, but because all those projects are moving fast. I monitor closely comments, so do not hesitate to report errors, comments, updates. If you think we are missing an important information about a project, feel free to ask for it.
This document was last updated on[lastupdated before=””].
All Projects
Backed by Apple
Backed by CMU
Backed by CloudOps
Backed by Cloudera
Backed by Confluent
Backed by CyberAgent
Backed by Data Fueled
Backed by DataStax
Backed by DataTorrent
Backed by Databricks
Backed by Dell Software
Backed by Ecetera
Backed by Facebook
Backed by Georgia Institute of Technology
Backed by GyroLabs
Backed by Hortonworks
Backed by Huawei
Backed by IBM
Backed by InMobi
Backed by Intel
Backed by Jive
Backed by MapR Technologies
Backed by Microsoft
Backed by Midokura
Backed by NASA
Backed by NASA JPL
Backed by NTT DATA
Backed by Nextiva
Backed by Nicira
Backed by ObjectStyle
Backed by Oracle
Backed by Pivotal
Backed by Pragsis
Backed by Red Hat
Backed by RentTheRunway
Backed by Riot Games
Backed by ScalingData
Backed by StreamSets
Backed by Talend
Backed by Twitter
Backed by UC Berkeley
Backed by UC Irvine
Backed by UC Riverside
Backed by Uber
Backed by University of Michigan
Backed by Vanderbilt University
Backed by WANdisco
Backed by Xing Cloud
Backed by Yahoo!
Backed by Zymergen
Category Analytics
Category Application Server
Category Big Data
Category Data
Category Data Store
Category ETL
Category Framework
Category Infrastructure
Category Java Library
Category LDAP
Category Messaging
Category NLP
Category ORM
Category OSGi
Category Publication
Category RPC
Category Resource Management
Category Security
Category Stream
Category Tools
Category UI
Committer:
Committer: Alibaba
Committer: Alluxio
Committer: Altiscale
Committer: Apple
Committer: Bizo
Committer: BlueTalon
Committer: Brazil Outsource
Committer: Carnegie Mellon University
Committer: ClearStory Data
Committer: CloudOps
Committer: Cloudera
Committer: Clover Health
Committer: Computer Associates
Committer: Confluent
Committer: Couchbase
Committer: CyberAgent
Committer: Data Fueled
Committer: DataStax
Committer: DataTorrent
Committer: Databricks
Committer: Dell Software
Committer: EMC
Committer: Ecetera
Committer: Facebook
Committer: Georgia Institute of Technology
Committer: Getopt
Committer: Google
Committer: Groupon
Committer: HP
Committer: HPI
Committer: Hebrew University
Committer: Hortonworks
Committer: Huawei
Committer: IBM
Committer: INRIA
Committer: InMobi
Committer: Indiana University
Committer: Intel
Committer: JPL
Committer: KACST Saudi Arabia
Committer: Lanka Software Foundation
Committer: LinkedIn
Committer: MIT
Committer: MapR Technologies
Committer: Mesosphere
Committer: Microsoft
Committer: Midokura
Committer: NASA
Committer: NASA JPL
Committer: NTNU Trondheim
Committer: NTT DATA
Committer: NTT Data
Committer: Netflix
Committer: Nextiva
Committer: Nicira
Committer: Northeastern University
Committer: ObjectStyle
Committer: Persistent Systems Limited
Committer: Pivotal
Committer: Purdue University
Committer: Quantifind
Committer: QuestTec
Committer: QuestTec B.V.
Committer: Rackspace
Committer: Red Hat
Committer: RentTheRunway
Committer: Rutgers University
Committer: ScalingData
Committer: Sonic Software
Committer: Sosnoski Software
Committer: StreamSets
Committer: Talend
Committer: Teradata
Committer: The Ohio State University
Committer: Twitter
Committer: UC Berkeley
Committer: UC Irvine
Committer: UC Riverside
Committer: UC Santa Barbara
Committer: Uber
Committer: University of Michigan
Committer: Vanderbilt University
Committer: WANdisco
Committer: WSO2
Committer: Webtrends
Committer: X15 Software
Committer: Yahoo!
Committer: YouTube
Committer: Zymergen
Committer: eBay
Name | Apache Accumulo |
Website | http://accumulo.apache.org/ |
@ApacheAccumulo | |
Categories | Big Data, Tools, Data Store |
Description | Apache Accumulo™ is a sorted, distributed key/value store that provides robust, scalable data storage and retrieval.
Apache Accumulo is based on the design of Google’s BigTable and is powered by Apache Hadoop, Apache Zookeeper, and Apache Thrift. |
Last update on JGP.net | |
VP | |
Production Version | 1.7.2 |
Last release date | |
Development Version | |
Last development drop date | |
Backers (PMC) | |
Committers |
Name | Apache ACE |
Website | http://ace.apache.org |
Categories | Framework, OSGi, Tools |
Description | Apache ACE™ is a software distribution framework that allows you to centrally manage and distribute software components, configuration data, and other artifacts to target systems. It is built using OSGi and can be deployed in different topologies. The target systems are usually also OSGi based, but don’t have to be. |
Last update on JGP.net | |
VP | |
Production Version | 2.1.0 |
Last release date | Tuesday, February 9, 2016 |
Development Version | |
Last development drop date | |
Backers (PMC) | |
Committers |
Name | Apache Ambari |
Website | http://ambari.apache.org/ |
Categories | Infrastructure |
Description | Apache Ambari™ aimes at making Hadoop management simpler.
Ambari enables System Administrators to provision, Manage, and monitor a Hadoop Cluster. Ambari provides an intuitive, easy-to-use Hadoop management web UI backed by its RESTful APIs. |
Last update on JGP.net | |
VP | Yusaku Sako |
Production Version | 2.2.2 |
Last release date | Monday, May 9, 2016 |
Development Version | |
Last development drop date | |
Backers (PMC) | Hortonworks, Red Hat, IBM, WANdisco |
Committers | Hortonworks, Pivotal, Red Hat, BlueTalon, IBM, WANdisco, eBay, Rutgers University, Microsoft, HP, Teradata, Rackspace, EMC, The Ohio State University, Carnegie Mellon University |
Name | Apache Apex |
Website | http://apex.apache.org/ |
@ApacheApex | |
Categories | Big Data, Tools, Stream |
Description | Apache Spark™ is an enterprise-grade unified stream and batch processing engine. Apex is a Hadoop add-on that unifies stream and batch processing. It processes big data in-motion in a way that is highly scalable, highly performant, fault tolerant, stateful, secure, distributed, and easily operable. |
Last update on JGP.net | |
VP | Thomas Weise |
Production Version | 3.4.0 |
Last release date | Wednesday, May 25, 2016 |
Development Version | |
Last development drop date | |
Backers (PMC) | DataTorrent |
Committers | DataTorrent, Google |
Name | Apache Arrow |
Website | http://arrow.apache.org/ |
Categories | Big Data, Analytics |
Description | Apache Arrow™ powers columnar and in-memory analytics.
Arrow enables execution engines to take advantage of the latest SIMD (Single Input Multiple Data) operations included in modern processors, for native vectorized optimization of analytical data processing. Columnar layout of data also allows for a better use of CPU caches by placing all data relevant to a column operation in as compact of a format as possible. |
Last update on JGP.net | |
VP | Jacques Nadeau |
Production Version | |
Last release date | |
Development Version | |
Last development drop date | |
Backers (PMC) | |
Committers |
Name | Apache AsterixDB |
Website | http://asterixdb.apache.org/ |
@ASTERIXUCI | |
Categories | Big Data, Data Store |
Description | Apache AsterixDB™ is a scalable Big Data Management System (BDMS). AsterixDB offers support for flexible data model, distributed storage and transaction, fast data ingestion, scalable and data-parallel query execution runtime, declarative query language.
AsterixDB supports various storage and indexing options, including managed datasets, external datasets (e.g., data on HDFS – Hadoop File System), and secondary indexes. |
Last update on JGP.net | |
VP | Till Westmann |
Production Version | |
Last release date | |
Development Version | 0.8.8-incubating |
Last development drop date | |
Backers (PMC) | UC Irvine, UC Riverside, Oracle |
Committers | Couchbase, UC Irvine, NTNU Trondheim, UC Riverside, Hebrew University, HPI, Couchbase, Microsoft, KACST Saudi Arabia, X15 Software |
Name | Apache Aurora |
Website | http://aurora.apache.org/ |
@ApacheAurora | |
Categories | Infrastructure, Framework |
Description | Apache Aurora™ is a Mesos framework for long-running services and cron jobs.
Aurora runs applications and services across a shared pool of machines, and is responsible for keeping them running, forever. When machines experience failure, Aurora intelligently reschedules those jobs onto healthy machines. |
Last update on JGP.net | |
VP | Bill Farner |
Production Version | |
Last release date | |
Development Version | 0.14.0 |
Last development drop date | Tuesday, June 14, 2016 |
Backers (PMC) | |
Committers |
Name | Apache Avro |
Website | http://avro.apache.org/ |
Categories | Data, Data Store, RPC |
Description | Apache Avro™ is a data serialization system, which provides rich data structures, a compact, fast, binary data format, a container file, to store persistent data, Remote Procedure Call (RPC), and simple integration with dynamic languages.
Code generation is not required to read or write data files nor to use or implement RPC protocols. Code generation can be done for optimization purpose, but only worth implementing for statically typed languages. Avro relies on schemas. When Avro data is read, the schema used when writing it is always present. This permits each datum to be written with no per-value overheads, making serialization both fast and small. This also facilitates use with dynamic, scripting languages, since data, together with its schema, is fully self-describing. When Avro data is stored in a file, its schema is stored with it, so that files may be processed later by any program. If the program reading the data expects a different schema this can be easily resolved, since both schemas are present. When Avro is used in RPC, the client and server exchange schemas in the connection handshake. This can be optimized so that, for most calls, no schemas are actually transmitted. Since both client and server both have the other’s full schema, correspondence between same named fields, missing fields, extra fields, etc. can all be easily resolved. |
Last update on JGP.net | |
VP | |
Production Version | 1.8.1 |
Last release date | Thursday, May 19, 2016 |
Development Version | |
Last development drop date | |
Backers (PMC) | |
Committers |
Name | Apache Axis2 |
Website | http://axis.apache.org/axis2/java/core/ |
Categories | Infrastructure |
Description | Apache Axis2™ is a Web Services, SOAP, and WSDL engine, the successor to the widely used Apache Axis SOAP stack. There are two implementations of the Apache Axis2 Web services engine: Apache Axis2/Java and Apache Axis2/C. |
Last update on JGP.net | |
VP | Deepal Jayasinghe |
Production Version | 1.7.3 |
Last release date | Monday, May 30, 2016 |
Development Version | |
Last development drop date | |
Backers (PMC) | Georgia Institute of Technology |
Committers | WSO2, IBM, Sonic Software, Indiana University, Lanka Software Foundation, Georgia Institute of Technology, Brazil Outsource, HP, Computer Associates, Sosnoski Software, Purdue University, UC Santa Barbara |
Name | Apache Bahir |
Website | http://bahir.apache.org/ |
Categories | Analytics, Big Data |
Description | Apache Bahir™ provides extensions to distributed analytic platforms such as Apache Spark. |
Last update on JGP.net | |
VP | Luciano Resende |
Production Version | |
Last release date | |
Development Version | |
Last development drop date | |
Backers (PMC) | NASA, Talend, IBM, Cloudera, Yahoo!, Hortonworks |
Committers | Databricks, UC Berkeley, NASA, Netflix, Alluxio, QuestTec, Talend, Cloudera, NTT Data, IBM, ClearStory Data, Yahoo!, Clover Health, Hortonworks |
Name | Apache Bigtop |
Website | http://bigtop.apache.org/ |
Categories | Big Data, Infrastructure |
Description | Apache Bigtop™ is a project targeted for Infrastructure Engineers and Data Scientists looking for comprehensive packaging, testing, and configuration of the leading open source big data components. Bigtop supports a wide range of components/projects, including, but not limited to, Hadoop, HBase and Spark. |
Last update on JGP.net | |
VP | Olaf Flebbe |
Production Version | 1.1.0 |
Last release date | |
Development Version | 1.2.0-SNAPSHOT |
Last development drop date | |
Backers (PMC) | |
Committers |
Name | Apache Calcite |
Website | http://calcite.apache.org/ |
Categories | Data Store |
Description | Apache Calcite™ is a dynamic data management framework.
It contains many of the pieces that comprise a typical database management system, but omits some key functions: storage of data, algorithms to process data, and a repository for storing metadata. Calcite intentionally stays out of the business of storing and processing data. This makes an excellent candidate for mediating between applications and one (or more) data storage locations and data processing engines. |
Last update on JGP.net | |
VP | Julian Hyde |
Production Version | 1.8.0 |
Last release date | Monday, June 13, 2016 |
Development Version | |
Last development drop date | |
Backers (PMC) | |
Committers |
Name | Apache Cassandra |
Website | http://cassandra.apache.org/ |
Categories | Data Store |
Description | Apache Cassandra™ is a NoSQL database focusing on scalability and high availability, without compromising performance. Linear scalability and proven fault-tolerance on commodity hardware or cloud infrastructure make it the perfect platform for mission-critical data. Cassandra’s support for replicating across multiple datacenters is best-in-class, providing lower latency for your users and the peace of mind of knowing that you can survive regional outages.
Cassandra’s data model offers the convenience of column indexes with the performance of log-structured updates, strong support for denormalization and materialized views, and powerful built-in caching. |
Last update on JGP.net | |
VP | Jonathan Ellis |
Production Version | 3.0.7, 2.2.6 |
Last release date | |
Development Version | |
Last development drop date | |
Backers (PMC) | DataStax |
Committers | DataStax |
Name | Apache Cayenne |
Website | http://cayenne.apache.org/ |
Categories | Java Library, Tools |
Description | Apache Cayenne™ provides Object-Relational Mapping (ORM) and remoting services. With a wealth of unique and powerful features, Cayenne can address a wide range of persistence needs. Cayenne seamlessly binds one or more database schemas directly to Java objects, managing atomic commit and rollbacks, SQL generation, joins, sequences, and more. With Cayenne’s Remote Object Persistence, those Java objects can even be persisted out to clients via Web Services.
Cayenne is designed to be easy to use, without sacrificing flexibility or design. To that end, Cayenne supports database reverse engineering and generation, as well as a Velocity-based class generation engine. All of these functions can be controlled directly through a GUI tool called CayenneModeler. No cryptic XML or annotation based configuration is required! An entire database schema can be mapped directly to Java objects within minutes, all from the comfort of the GUI-based CayenneModeler. Cayenne supports numerous other features, including caching, a complete object query syntax, relationship pre-fetching, on-demand object and relationship faulting, object inheritance, database auto-detection, and generic persisted objects. Most importantly, Cayenne can scale up or down to virtually any project size. With a mature, 100% open source framework, an energetic user community, and a track record of solid performance in high-volume environments, Cayenne is an exceptional choice for persistence services. |
Last update on JGP.net | |
VP | Andrus Adamchik |
Production Version | 3.1.1 |
Last release date | Monday, May 16, 2016 |
Development Version | 4.0m3 |
Last development drop date | Friday, February 12, 2016 |
Backers (PMC) | ObjectStyle |
Committers | ObjectStyle |
Name | Apache Chukwa |
Website | http://chukwa.apache.org/ |
Categories | Infrastructure, Tools |
Description | Apache Chukwa™ is a data collection system for monitoring large distributed systems. Chukwa is built on top of the Hadoop Distributed File System (HDFS) and Map/Reduce framework and inherits Hadoop’s scalability and robustness. Chukwa also includes a flexible and powerful toolkit for displaying, monitoring and analyzing results to make the best use of the collected data. |
Last update on JGP.net | |
VP | Eric Yang |
Production Version | |
Last release date | |
Development Version | 0.7.0 |
Last development drop date | Monday, December 21, 2015 |
Backers (PMC) | UC Berkeley, Twitter, IBM, Riot Games, GyroLabs, Intel, Xing Cloud, Pragsis, CMU |
Committers |
Name | Apache CloudStack |
Website | http://cloudstack.apache.org/ |
Categories | Infrastructure |
Description | Apache CloudStack™ is designed to deploy and manage large networks of virtual machines, as a highly available, highly scalable Infrastructure as a Service (IaaS) cloud computing platform. CloudStack is used by service providers to offer public cloud services, and by many companies to provide an on-premises (private) cloud offering, or as part of a hybrid cloud solution.
CloudStack is a turnkey solution that includes the entire “stack” of features most organizations want with an IaaS cloud: compute orchestration, Network-as-a-Service, user and account management, a full and open native API, resource accounting, and a first-class User Interface (UI). CloudStack currently supports the most popular hypervisors: VMware, KVM, Citrix XenServer, Xen Cloud Platform (XCP), Oracle VM server, and Microsoft Hyper-V. Users can manage their cloud with an easy to use Web interface, command line tools, and/or a full-featured RESTful API. In addition, CloudStack provides an API that’s compatible with AWS EC2 and S3 for organizations that wish to deploy hybrid clouds. |
Last update on JGP.net | |
VP | Will Stevens |
Production Version | 4.8.0.1 |
Last release date | |
Development Version | |
Last development drop date | |
Backers (PMC) | CloudOps |
Committers | CloudOps |
Name | Apache Commons |
Website | http://commons.apache.org/ |
Categories | Java Library |
Description | Apache Commons™ is a project focused on all aspects of reusable Java components. |
Last update on JGP.net | |
VP | |
Production Version | Varies by sub-project |
Last release date | |
Development Version | |
Last development drop date | |
Backers (PMC) | |
Committers |
Name | Apache CouchDB |
Website | http://couchdb.apache.org/ |
Categories | Data Store |
Description | Apache CouchDB™ is a database that uses JSON for documents, JavaScript for MapReduce indexes, and regular HTTP for its API. |
Last update on JGP.net | |
VP | |
Production Version | 1.6.1 |
Last release date | Wednesday, September 3, 2014 |
Development Version | 2.0.0 |
Last development drop date | |
Backers (PMC) | |
Committers |
Name | Apache Crunch |
Website | http://crunch.apache.org/ |
Categories | Big Data, Tools |
Description | The Apache Crunch™ Java library provides a framework for writing, testing, and running MapReduce pipelines. Its goal is to make pipelines that are composed of many user-defined functions simple to write, easy to test, and efficient to run.
Running on top of Hadoop MapReduce and Apache Spark, the Apache Crunch library is a simple Java API for tasks like joining and data aggregation that are tedious to implement on plain MapReduce. The APIs are especially useful when processing data that does not fit naturally into relational model, such as time series, serialized object formats like protocol buffers or Avro records, and HBase rows and columns. |
Last update on JGP.net | |
VP | |
Production Version | |
Last release date | |
Development Version | |
Last development drop date | |
Backers (PMC) | |
Committers |
Name | Apache cTAKES |
Website | http://ctakes.apache.org/ |
Categories | Java Library, NLP |
Description | Apache cTAKES™ is a natural language processing system for extraction of information from electronic medical record clinical free-text. cTAKES can discover codable entities, temporal events, properties and relations. It relies on customized pipelines, which can be used with any Apache UIMA compatible component. |
Last update on JGP.net | |
VP | |
Production Version | |
Last release date | |
Development Version | |
Last development drop date | |
Backers (PMC) | |
Committers |
Name | Apache Curator |
Website | http://curator.apache.org/ |
@ApacheCurator | |
Categories | Big Data, Tools |
Description | Apache Curator™ is an Apache ZooKeeper keeper: it provides implementations of some of the common ZooKeeper “recipes”. The implementations are built on top of the Curator Framework, which is a high-level API that greatly simplifies using ZooKeeper. It adds many features that build on ZooKeeper and handles the complexity of managing connections to the ZooKeeper cluster and retrying operations. |
Last update on JGP.net | |
VP | |
Production Version | |
Last release date | |
Development Version | |
Last development drop date | |
Backers (PMC) | |
Committers |
Name | Apache CXF |
Website | http://cxf.apache.org/ |
Categories | Java Library, Tools |
Description | Apache CXF™ is a services-oriented framework. CXF helps you build and develop services using frontend programming APIs, like JAX-WS and JAX-RS. These services can speak a variety of protocols such as SOAP, XML/HTTP, RESTful HTTP, or CORBA and work over a variety of transports such as HTTP, JMS or JBI. |
Last update on JGP.net | |
VP | |
Production Version | 3.1.6 |
Last release date | Tuesday, March 29, 2016 |
Development Version | |
Last development drop date | |
Backers (PMC) | |
Committers |
Name | Apache DB |
Website | http://db.apache.org/ |
Categories | Data Store, Tools, ORM |
Description | Apache DB™ is charged with the creation and maintenance of commercial-quality database solutions based on software licensed to the Foundation, for distribution at no charge to the public. It includes several subprojects.
DdlUtils is a small, easy-to-use component for working with Database Definition (DDL) files. These are XML files that contain the definition of a database schema, e.g. tables and columns. Apache Derby™ is an relational database implemented entirely in Java and available. Some key advantages include: small footprint – about 2.6 megabytes for the base engine and embedded JDBC driver, is based on the Java, JDBC, and SQL standards, provides an embedded JDBC driver that lets you embed Derby in any Java-based solution. Derby is derived from Informix Cloudscape. Apache JDO™ is a standard way to access persistent data in databases, using plain old Java objects (POJO) to represent persistent data. Apache Torque is an object-relational mapper (ORM) for Java. It lets you access and manipulate data in a relational database using Java objects, without using reflection to access user-provided classes, but by generating the necessary classes (including the Data Objects) from an XML schema describing the database layout. |
Last update on JGP.net | |
VP | |
Production Version | Varies by sub-project |
Last release date | |
Development Version | |
Last development drop date | |
Backers (PMC) | |
Committers |
Name | Apache DeltaSpike |
Website | http://deltaspike.apache.org |
Categories | Security, Tools |
Description | Apache DeltaSpike™ is a collection of portable CDI (Context and Dependency injection) extensions. These ready-to-use modules enable you to integrate tested API extensions into your Java projects.
DeltaSpike consists of a core module and a number of optional modules for providing additional enterprise functionality to your applications. The modules include features for enhanced security with type-safe control over method invocations, integration with schedulers, injection of CDI objects into validators, a transactional context and scope, and much more. |
Last update on JGP.net | |
VP | |
Production Version | 1.7.1 |
Last release date | Wednesday, July 20, 2016 |
Development Version | |
Last development drop date | |
Backers (PMC) | |
Committers |
Name | Apache DeviceMap |
Website | http://devicemap.apache.org/ |
Categories | Data |
Description | Apache DeviceMap™ is a data repository containing device information, images and other relevant information for all sorts of mobile devices, e.g. smartphones and tablets. While the focus is on the data, an set of APIs exist to use and manage it. |
Last update on JGP.net | |
VP | |
Production Version | 1.0.3 |
Last release date | |
Development Version | |
Last development drop date | |
Backers (PMC) | |
Committers |
Name | Apache Directory |
Website | http://directory.apache.org/ |
Categories | Security, Tools, LDAP |
Description | The Apache Directory™ strives to increase LDAP awareness, comfort, and adoption to bring forth what we call the Modern LDAP Renaissance. Directory provides directory solutions entirely written in Java, including a directory server, which has been certified as LDAP v3 compliant by the Open Group (ApacheDS), and Eclipse-based directory tools (Apache Directory Studio). |
Last update on JGP.net | |
VP | |
Production Version | Varies by sub-project |
Last release date | |
Development Version | |
Last development drop date | |
Backers (PMC) | |
Committers |
Name | Apache Drill |
Website | http://drill.apache.org/ |
Categories | Data, Big Data, Analytics, Tools |
Description | Apache Drill is a schema-free SQL Query Engine for Hadoop, NoSQL, and Cloud Storage
Drill allows to get faster insights without the overhead (data loading, schema creation and maintenance, transformations, etc.) |
Last update on JGP.net | |
VP | |
Production Version | 1.7.0 |
Last release date | Tuesday, June 28, 2016 |
Development Version | |
Last development drop date | |
Backers (PMC) | |
Committers |
Name | Apache Empire-db |
Website | http://empire-db.apache.org/ |
Categories | Java Library |
Description | Apache Empire-db™ is a relational database abstraction layer and data persistence component that allows developers to take a much more SQL-centric approach in application development than traditional Object-Relational Mapping frameworks (ORM), like JPA or Hibernate. By providing a unique type-safe object orientated command API Empire-db allows building highly efficient SQL-statements that take full advantage of all database features while eliminating the need for error-prone string operations and literals. This, together with DBMS independent record and metadata managment leads to an unprecedented level of ease-of-use and compile-time-safety. |
Last update on JGP.net | Tuesday, August 9, 2016 |
VP | |
Production Version | 2.4.4 |
Last release date | Tuesday, August 18, 2015 |
Development Version | |
Last development drop date | |
Backers (PMC) | |
Committers |
Name | Apache Falcon |
Website | http://falcon.apache.org/ |
Categories | Infrastructure, Big Data |
Description | Apache Falcon™ is a feed processing and management system aimed at making it easier for end consumers to onboard their feed processing and management on Hadoop clusters. |
Last update on JGP.net | Tuesday, August 9, 2016 |
VP | |
Production Version | |
Last release date | |
Development Version | 0.10 |
Last development drop date | Monday, August 8, 2016 |
Backers (PMC) | InMobi, Hortonworks, Talend |
Committers | InMobi, Hortonworks, Talend |
Name | Apache Felix |
Website | http://felix.apache.org/ |
Categories | Framework, OSGi, Tools |
Description | Apache Felix™ is an implementation the OSGi framework and service platform, as well as other interesting OSGi-related technologies. The OSGi specifications originally targeted embedded devices and home services gateways, but they are ideally suited for any project interested in the principles of modularity, component-orientation, and/or service-orientation. OSGi technology combines aspects of these aforementioned principles to define a dynamic service deployment framework that is amenable to remote management.
It is composed of AutoConf Resource Processor, Bundle Repository, Configuration Admin, Connect, Coordinator, Dependency Manager 4, Deployment Admin, Event Admin, File Install, Framework Security, Framework, Gogo Command, Gogo Runtime, Gogo Shell, HTTP Service API, HTTP Service Base, HTTP Service Bridge, HTTP Service Bundle, HTTP Service Cometd, HTTP Service Jetty, HTTP Service Proxy, HTTP Service SSL filter, HTTP Service Whiteboard, HTTP Servlet 2.6 + 3.0 API, Inventory Printer, iPOJO (Core), iPOJO Annotations, iPOJO API, iPOJO Architecture Command for Gogo, iPOJO Architecture Command for the Felix Shell, iPOJO BND Plugin, iPOJO Composite, iPOJO Event Admin handler, iPOJO Extender pattern handler, iPOJO JMX handler, iPOJO Manipulator Project (containing all iPOJO manipulator modules), iPOJO Manipulator, iPOJO Runtime Project (containing all iPOJO runtime modules), iPOJO Temporal Dependency handler, iPOJO WebConsole Plugin, iPOJO Whiteboard pattern handler, Jaas Support Boot Classpath Jar, Jaas Support Bundle, junit4osgi – shell command, junit4osgi – swing gui, junit4osgi, Lightweight HTTP Service Complete, Lightweight HTTP Service Core, Log, Main, Metatype, OSGi OBR service API, Preferences, Remote Shell, Resolver, SCR (Declarative Services), SCR Annotations, SCR Bnd Plugin, SCR Compat (Declarative Services), SCR DS Annotations, SCR Ext Anno, SCR Generator, Shell Text UI, Shell, UPnP Base Driver (JDK 1.3), UPnP Base Driver (JDK 1.4+), UPnP Extra, UPnP Tester, User Admin, User Admin File-based store, User Admin MongoDB-based store, Utils, Web Console, Web Console (all-in-one bundle), Web Console DS Plugin, Web Console Event Admin Plugin, Web Console Memory Usage Plugin, Web Console OBR Plugin, Web Console PackageAdmin Plugin, Web Console Script Console Plugin, Web Console ServiceDiagnostics Plugin, Web Console Subsystems Plugin, Web Console UPNP Plugin, and Web Console User Admin Plugin. Each module has it’s own version and might be more recent than the core module. |
Last update on JGP.net | Tuesday, August 9, 2016 |
VP | |
Production Version | 5.4.0 (Core Frmaework) |
Last release date | Friday, October 16, 2015 |
Development Version | |
Last development drop date | |
Backers (PMC) | |
Committers |
Name | Apache Flex |
Website | http://flex.apache.org/ |
@ApacheFlex | |
Categories | UI |
Description | Apache Flex™ is a highly productive, open source application framework for building and maintaining expressive web applications that deploy consistently on all major browsers, desktops and devices (including smartphones, tablets and tv). It provides a modern, standards-based language and programming model that supports common design patterns suitable for developers from many backgrounds. Flex applications can be deployed to the ubiquitous Adobe® Flash® Player in the browser, Adobe® AIR™ on desktop and mobile or to native Android™, IOS™, QNX®, Windows® or Mac® applications. |
Last update on JGP.net | Tuesday, August 9, 2016 |
VP | Alex Harui |
Production Version | Apache Flex SDK : 4.15 (January 2016) Apache FlexJS SDK : 0.6.0 (April 2016) Blaze DS : 4.7.0 (Mar 2015) Flex Mavenizer : 4.8.0 (Jan 2013) FlexUnit : 4.2.0 (Apr 2014) SDK Installer : 3.2.0 (June 2015) Squiggly : 1.1 (Oct 2014) Tour De Flex : 1.2 (Nov 2014) |
Last release date | |
Development Version | |
Last development drop date | |
Backers (PMC) | |
Committers |
Name | Apache Flink |
Website | http://flink.apache.org/ |
@ApacheFlink | |
Categories | Infrastructure, Big Data |
Description | Apache Flink™ is a platform for distributed stream and batch data processing.
Flink’s core is a streaming dataflow engine that provides data distribution, communication, and fault tolerance for distributed computations over data streams. |
Last update on JGP.net | Tuesday, August 9, 2016 |
VP | |
Production Version | 1.1.0 |
Last release date | Monday, August 8, 2016 |
Development Version | |
Last development drop date | |
Backers (PMC) | |
Committers |
Name | Apache Flume |
Website | http://flume.apache.org/ |
Categories | Big Data, ETL, Infrastructure |
Description | Apache Flume™ is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. It uses a simple extensible data model that allows for online analytic application. |
Last update on JGP.net | Tuesday, August 9, 2016 |
VP | |
Production Version | 1.6.0 |
Last release date | Wednesday, May 20, 2015 |
Development Version | |
Last development drop date | |
Backers (PMC) | Apple, Apple, Cloudera, Cloudera, Cloudera, Cloudera, Cloudera, Cloudera, Cloudera, Cloudera, Cloudera, CyberAgent, Data Fueled, Hortonworks, Nextiva, ScalingData, StreamSets, Vanderbilt University, Zymergen |
Committers | Apple, Cloudera, Cloudera, Databricks, , Apple, Apple, Cloudera, Cloudera, Cloudera, Cloudera, Cloudera, Cloudera, Cloudera, Cloudera, Cloudera, CyberAgent, Data Fueled, Hortonworks, Nextiva, ScalingData, StreamSets, Vanderbilt University, Zymergen |
Name | Apache Forrest |
Website | http://forrest.apache.org/ |
Categories | Infrastructure, Publication |
Description | Apache Forrest™ software is a publishing framework that transforms input from various sources into a unified presentation in one or more output formats. The modular and extensible plug-in architecture of Apache Forrest is based on Apache Cocoon and the relevant industry standards that separate presentation from content. Forrest can generate static documents, or be used as a dynamic server, or be deployed by its automated facility. |
Last update on JGP.net | Tuesday, August 9, 2016 |
VP | |
Production Version | |
Last release date | |
Development Version | 0.9 |
Last development drop date | Saturday, July 2, 2011 |
Backers (PMC) | |
Committers |
Name | Apache Geronimo |
Website | http://geronimo.apache.org/ |
Categories | Application Server |
Description | Apache Geronimo™ is a server runtime that integrates the best open source projects to create Java/OSGi server runtimes that meet the needs of enterprise developers and system administrators. Our most popular distribution is a fully certified Java EE 6 application server runtime. |
Last update on JGP.net | Monday, September 12, 2016 |
VP | |
Production Version | 3.0.1 |
Last release date | Tuesday, December 5, 2028 |
Development Version | |
Last development drop date | |
Backers (PMC) | |
Committers |
Name | Apache Giraph |
Website | http://giraph.apache.org/ |
Categories | Analytics, Big Data |
Description | Apache Giraph™ is an iterative graph processing system built for high scalability. For example, it is currently used at Facebook to analyze the social graph formed by users and their connections. |
Last update on JGP.net | Monday, September 12, 2016 |
VP | |
Production Version | 1.1.0 |
Last release date | Wednesday, November 19, 2014 |
Development Version | 1.2.0-SNAPSHOT |
Last development drop date | Wednesday, August 31, 2016 |
Backers (PMC) | |
Committers |
Name | Apache Gora |
Website | http://gora.apache.org/ |
@ApacheGora | |
Categories | Data, Data Store, Big Data |
Description | Apache Gora™ provides an in-memory data model and persistence for big data. Gora supports persisting to column stores, key value stores, document stores and RDBMSs, and analyzing the data with extensive Apache Hadoop™ MapReduce support. |
Last update on JGP.net | Monday, September 12, 2016 |
VP | |
Production Version | |
Last release date | |
Development Version | 0.6.1 |
Last development drop date | Monday, September 14, 2015 |
Backers (PMC) | |
Committers |
Name | Apache Hadoop |
Website | http://hadoop.apache.org/ |
Categories | Data Store, Big Data |
Description | Apache Hadoop® offers open-source software for reliable, scalable, distributed computing.
The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures. |
Last update on JGP.net | |
VP | Chris (Christopher) Douglas |
Production Version | 2.6.4 |
Last release date | Thursday, February 11, 2016 |
Development Version | |
Last development drop date | |
Backers (PMC) | NTT DATA, Hortonworks, InMobi, Cloudera, Yahoo!, Microsoft, Intel, Facebook, Jive, WANdisco, Twitter, Huawei, Uber |
Committers | NTT DATA, Getopt, Hortonworks, Cloudera, InMobi, Yahoo!, Altiscale, eBay, LinkedIn, Huawei, WANdisco, Microsoft, Intel, Intel, IBM, Twitter, UC Berkeley, Pivotal, INRIA, Uber |
Name | Apache Kafka |
Website | http://kafka.apache.org/ |
@apachekafka | |
Categories | Infrastructure, Tools, Messaging |
Description | Apache Kafka™ is publish/subscribe messaging designed as a distributed commit log.
A single Kafka broker can handle hundreds of megabytes of reads and writes per second from thousands of clients. It is designed to allow a single cluster to serve as the central data backbone for a large organization. It can be elastically expanded without downtime. Data streams are partitioned and spread over a cluster of machines to allow data streams larger than the capability of any single machine and to allow clusters of coordinated consumers. Messages are persisted on disk and replicated within the cluster to prevent data loss. Each broker can handle terabytes of messages without performance impact. |
Last update on JGP.net | |
VP | Jun Rao |
Production Version | |
Last release date | |
Development Version | 0.10.0 |
Last development drop date | |
Backers (PMC) | |
Committers |
Name | Apache Kudu |
Website | http://kudu.apache.org/ |
@ApacheKudu | |
Categories | Big Data, Analytics |
Description | Apache Kudu™ completes Hadoop’s storage layer to enable fast analytics on fast data.
Streamlined Architecture – Kudu provides a combination of fast inserts/updates and efficient columnar scans to enable multiple real-time analytic workloads across a single storage layer. As a new complement to HDFS and Apache HBase, Kudu gives architects the flexibility to address a wider variety of use cases without exotic workarounds. Faster Analytics – Kudu is specifically designed for use cases that require fast analytics on fast (rapidly changing) data. Engineered to take advantage of next-generation hardware and in-memory processing, Kudu lowers query latency significantly for Apache Impala (incubating) and Apache Spark (initially, with other execution engines to come). |
Last update on JGP.net | Wednesday, February 1, 2017 |
VP | Todd Lipcon |
Production Version | 1.2.0 |
Last release date | Wednesday, January 18, 2017 |
Development Version | |
Last development drop date | |
Backers (PMC) | Cloudera |
Committers | Cloudera |
Name | Apache Mesos |
Website | http://mesos.apache.org/ |
Categories | Infrastructure, Resource Management |
Description | Apache Mesos™ allows to program against your datacenter like it’s a single pool of resources. Mesos abstracts CPU, memory, storage, and other compute resources away from machines (physical or virtual), enabling fault-tolerant and elastic distributed systems to easily be built and run effectively.
Mesos is built using the same principles as the Linux kernel, only at a different level of abstraction. The Mesos kernel runs on every machine and provides applications (e.g., Hadoop, Spark, Kafka, Elastic Search) with API’s for resource management and scheduling across entire datacenter and cloud environments. |
Last update on JGP.net | |
VP | Benjamin Hindman |
Production Version | |
Last release date | |
Development Version | 0.28.2 |
Last development drop date | |
Backers (PMC) | |
Committers | Facebook, Mesosphere, Northeastern University, Twitter, UC Berkeley, YouTube, Databricks, Carnegie Mellon University, NASA JPL, NASA, Groupon, Red Hat, |
Name | Apache Metron |
Website | http://metron.apache.org/ |
@ApacheMetron | |
Categories | Infrastructure, Tools, Big Data, Security |
Description | Apache Metron™ provides a scalable advanced security analytics framework built with the Hadoop Community, evolving from the Cisco OpenSOC Project. Metron is a cyber security application framework that provides organizations the ability to detect cyber anomalies and enable organizations to rapidly respond to identified anomalies. |
Last update on JGP.net | |
VP | |
Production Version | |
Last release date | |
Development Version | 0.2.0-beta-RC2 |
Last development drop date | |
Backers (PMC) | Hortonworks |
Committers | Hortonworks |
Name | Apache Myriad |
Website | http://myriad.incubator.apache.org/ |
@ApacheMyriad | |
Categories | Big Data, Tools, Resource Management |
Description | Apache Myriad™ bridges the gap between Apache Mesos and YARN (as of YARN in Apache Hadoop). It is still in incubation but is really a promising piece of technology when it comes to dynamic resource allocation.
It is defined as “Apache Myriad enables the co-existence of Apache Hadoop and Apache Mesos on the same physical infrastructure. By running Hadoop YARN as a Mesos framework, YARN applications and Mesos frameworks can run side-by-side, dynamically sharing cluster resources.” |
Last update on JGP.net | |
VP | |
Production Version | |
Last release date | |
Development Version | |
Last development drop date | |
Backers (PMC) | |
Committers |
Name | Apache NiFi |
Website | http://nifi.apache.org/ |
Categories | Infrastructure, Big Data |
Description | Apache NiFi™ supports powerful and scalable directed graphs of data routing, transformation, and system mediation logic.
NiFi was built to automate the flow of data between systems. This kind of problem has been around since enterprises had more than one system, where some of the systems created data and some of the systems consumed data. NiFi aims at simplifying Enterprise Integration Patterns. NiFi includes a web-based user interface, it is highly configurable, tracks data flows from beginning to end (aka Data Provenance). It is designed for extension. Security is built in, with support for SSL, SSH, HTTPS, encrypted content, and pluggable role-based authentication/authorization. |
Last update on JGP.net | |
VP | Joe Witt |
Production Version | |
Last release date | |
Development Version | 0.6.1 |
Last development drop date | |
Backers (PMC) | Hortonworks |
Committers | Hortonworks |
Name | Apache POI |
Website | http://poi.apache.org/ |
Categories | Java Library |
Description | The Java API for handling Microsoft Documents. The Apache POI™ Project’s mission is to create and maintain Java APIs for manipulating various file formats based upon the Office Open XML standards (OOXML) and Microsoft’s OLE 2 Compound Document format (OLE2). In short, you can read and write Microsoft Excel files using Java. In addition, you can read and write Microsoft Word and Microsoft PowerPoint files using Java. Apache POI is your Java Excel solution (for Excel 97-2008). We have a complete API for porting other OOXML and OLE2 formats and welcome others to participate.
OLE2 files include most Microsoft Office files such as XLS (Excel), DOC (Word), and PPT (PowerPoint) as well as MFC serialization API based file formats. The project provides APIs for the OLE2 Filesystem (POIFS) and OLE2 Document Properties (HPSF). Office OpenXML Format is the new standards based XML file format found in Microsoft Office 2007 and later. This includes XLSX, DOCX and PPTX. POI provides a low level API to support the Open Packaging Conventions. |
Last update on JGP.net | |
VP | Dominik Stadler |
Production Version | 3.14 |
Last release date | Saturday, March 5, 2016 |
Development Version | 3.15-beta1 |
Last development drop date | Friday, April 15, 2016 |
Backers (PMC) | |
Committers |
Name | Apache Spark |
Website | http://spark.apache.org/ |
Categories | Data Store, Big Data |
Description | Apache Spark™ is a fast and general engine for large-scale data processing. |
Last update on JGP.net | |
VP | Matei Zaharia |
Production Version | 2.0.0 |
Last release date | Tuesday, July 26, 2016 |
Development Version | 2.0.1 |
Last development drop date | |
Backers (PMC) | IBM, Databricks, University of Michigan, Intel, Facebook, Yahoo!, UC Berkeley |
Committers | Databricks, University of Michigan, Intel, UC Berkeley, Facebook, Yahoo!, Bizo, ClearStory Data, QuestTec B.V., IBM, Quantifind, Alluxio, Hortonworks, Webtrends, Cloudera, Clover Health, NTT Data, Netflix, Alibaba, MIT |
Name | Apache Sqoop |
Website | http://sqoop.apache.org/ |
Categories | Big Data, ETL |
Description | Apache Sqoop™ is a tool designed for efficiently transferring bulk data between Apache Hadoop and structured datastores such as relational databases |
Last update on JGP.net | |
VP | Jarek Jarcec Cecho |
Production Version | 1.4.6 |
Last release date | Friday, April 24, 2015 |
Development Version | 1.99.6 |
Last development drop date | |
Backers (PMC) | Cloudera, Apple, StreamSets, NASA, NASA JPL, Dell Software, Zymergen, Ecetera, Pivotal |
Committers | Cloudera, Apple, StreamSets, JPL, Dell Software, Zymergen, Ecetera, Pivotal, Persistent Systems Limited, Google, WANdisco, Intel |
Name | Apache ZooKeeper |
Website | http://zookeeper.apache.org/ |
Categories | Infrastructure |
Description | Apache ZooKeeper™ is an open-source server, which enables highly reliable distributed coordination.
ZooKeeper is a centralized service for maintaining configuration information, naming, providing distributed synchronization, and providing group services. All of these kinds of services are used in some form or another by distributed applications. Each time they are implemented there is a lot of work that goes into fixing the bugs and race conditions that are inevitable. Because of the difficulty of implementing these kinds of services, applications initially usually skimp on them, which make them brittle in the presence of change and difficult to manage. Even when done correctly, different implementations of these services can lead to management complexity when the applications are deployed. |
Last update on JGP.net | |
VP | Flavio Junqueira |
Production Version | 3.4.8 |
Last release date | Saturday, February 20, 2016 |
Development Version | 3.5.1-alpha |
Last development drop date | Monday, August 31, 2015 |
Backers (PMC) | MapR Technologies, RentTheRunway, Cloudera, Confluent, Hortonworks, Facebook, Midokura, Nicira |
Committers | MapR Technologies, RentTheRunway, Cloudera, Confluent, Hortonworks, Facebook, Midokura, Nicira, Intel, Google, Twitter |