About the services
Airbyte is the leading open-source ELT tool, created in July 2020. As of September 2021, they have built over 120 data connectors, and have 5,000 companies using them to sync data. Their ambition is to commoditize data integration by addressing the long tail of connectors through their growing contributor community. Airbyte released a Cloud offer in October 2021 with an infrastructure-type pricing model.
Singer is an open-source JSON-based ETL framework. It was introduced in 2017 by Stitch (which was acquired by Talend in 2018) as a way to offer extendibility to the connectors they had pre-built. Talend has unfortunately stopped investing in Singer’s community and providing maintenance for the Singer’s taps and targets, which are increasingly outdated.
Pre-built connectors are the primary way to differentiate ETL / ELT solutions, as they enable data teams to focus only on the insights to build.
Within 14 months from inception, Airbyte already offers connectors for more than 120 data sources, and all major data warehouses, lakes and databases as data destinations.
All Airbyte connectors are open-sourced and can be edited to address any custom needs the customers have. Airbyte users can leverage these connectors through the open-source edition or the Cloud offer.
Airbyte’s Connector Development Kit also enables their users to build custom connectors in a standardized way within 2 hours (instead of 2 days), and the Airbyte team and community can help in their maintenance.
About 30% of the connectors have been contributed by the growing community. Airbyte will provide a SLA for the certified connectors, but Airbyte’s ambition is also to provide a SLA for other connectors through the community and its participative model on the long tail of connectors, and to reach 1,000+ connectors in the next few years.
Airbyte will offer reverse-ETL connectors in 2022.
Talend (acquirer of StitchData) seems to have stopped investing in maintaining Singer’s community and connectors. As most connectors see schema changes several times a year, more and more Singer’s taps and targets are not actively maintained and are becoming outdated.
On Singer, each connector is its own open-source project. So you never know the quality of a tap or target until you have actually used it. There is no guarantee whatsoever about what you’ll get.
Finally, Singer’s connectors are standalone binaries: you still need to build everything around to make them work (e.g., UI, configuration validation, state management, normalization, schema migration, monitoring, etc.).
Airbyte is an ELT tool, and does not transform data prior to loading. Airbyte offers two options to get your data out of the box: a serialized JSON object and the normalized version of the record as tables.
Airbyte also offers custom transformations via SQL and through deep integration with dbt, allowing their users and customers to trigger their own dbt packages at the destination level right after the EL.
Singer doesn’t provide any transformation features.
Every company has custom data architectures and, therefore, unique data integration needs. A lot of tools don’t enable teams to address those, which results in a lot of investment in building and maintaining additional in-house scripts.
Airbyte’s architecture modularity implies that you can leverage any part of Airbyte. For instance, you can use Airflow’s orchestrator to trigger Airbyte’s ELT jobs.
You can also edit any pre-built connectors to your own specific needs, or even leverage the Connector Development Kit to build your own custom connectors in a matter of hours (instead of days) and have its maintenance shared with the community and the Airbyte team.
Airbyte’s promise is to address all your ELT needs and the long tail of integrations.
Being open source means you can leverage Singer’s taps and targets the way you want. But the lack of standardization across them makes it a difficult task to leverage those connectors to address your custom needs.
Support & docs
Data integration tools can be complex, so customers need to have great support channels. This includes online documentation as well as tutorials, email and chat support. More complicated tools may also offer training services.
Airbyte provides chat support directly on their web app, with an average time to respond of 1 hour.
Their documentation is comprehensive and full of tutorials.
Airbyte also has a Slack and Discourse community where help is available from the Airbyte team, other users or contributors.
Airbyte does not provide any training services.
Singer has a dying Slack community, and doesn’t provide any support. It has open-sourced documentation.
Airbyte provides a one-month free trial or $400 worth of credits, whichever expires first. Airbyte’s pricing is credit-based, and you consume credits based on compute time and, to a lesser extent, egress costs. Airbyte positions itself as a self-service infrastructure company.
This pricing structure adapts well to all use cases, including database replication.
Airbyte doesn’t charge for failed syncs or normalization.
Airbyte offers adapted pricing to customers with large volumes.
Singer’s premium service is Stitch. Please refer to the Airbyte vs. Stitch article for more details.