close
The Wayback Machine - https://web.archive.org/web/20210214114912/https://github.com/topics/dataops
Skip to content
#

dataops

Here are 40 public repositories matching this topic...

flyte
kumare3
kumare3 commented Feb 8, 2021

Motivation: Why do you think this is important?
Flytekit should support Vaex as a pandas alternative for FlyteSchema object.
https://github.com/vaexio/vaex

Vaex has great performance on a single machine, which is usually needed for most datasets. Spark & dask are overkill with lots of complexity for datasets of sizes in few gigabytes. Addition of Vaex and support for automatic serializati

Streaming reference architecture for ETL with Kafka and Kafka-Connect. You can find more on http://lenses.io on how we provide a unified solution to manage your connectors, most advanced SQL engine for Kafka and Kafka Streams, cluster monitoring and alerting, and more.

  • Updated Feb 13, 2021
  • Scala

Improve this page

Add a description, image, and links to the dataops topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the dataops topic, visit your repo's landing page and select "manage topics."

Learn more