dataops

Motivation: Why do you think this is important?
Flytekit should support Vaex as a pandas alternative for FlyteSchema object.
https://github.com/vaexio/vaex

Vaex has great performance on a single machine, which is usually needed for most datasets. Spark & dask are overkill with lots of complexity for datasets of sizes in few gigabytes. Addition of Vaex and support for automatic serializati

It would be great to make the REST timeout configurable. I have run into timeouts while making requests to a schema registry with a large number of schemas.

It currently seems to be hardcoded here:
https://github.com/cloudhut/kowl/blob/5b135edb5f237f5600742514c001150fa93ac5fe/frontend/src/state/backendApi.ts#L19

See:

position
positionUTF8
positionCaseInsensitive
positionCaseInsensitiveUTF8

etc...

In thread in Slack datahackersbr proposes:

split the project in folders: framework, endpoint, config.py
framework will contain api logic, and a endpoint factor to load all endpoints setted in config.py
the user will only add files to endpoint and edit config.py file.

Jan	FEB	Mar
	14
2020	2021	2022

dataops

Here are 40 public repositories matching this topic...

lensesio / fast-data-dev

flyteorg / flyte

[Feature][Flytekit Plugin] Vaex Dataframe plugin

[Feature]Provide a AWS LaunchStack button for Flyte

[Feature][Flytekit] Task and Workflow docstrings should be converted to descriptions

cloudhut / kowl

Make rest timeout configurable

lensesio / stream-reactor

whylabs / whylogs-python

taivop / awesome-data-annotation

lensesio / lenses-docker

blueapron / kafka-connect-protobuf-converter

sernst / cauldron

lensesio / kafka-helm-charts

VulknData / vulkn

String().position* variants should be parameters to a single function

Tests

lensesio / lenses-go

tkleykamp / DataOps

DCMSstats / eesectors

opensource-advocates / datamanagement

blueapron / kafka-connect-jdbc

hokstack / hok-helm

axsaucedo / scalable-data-science

leomaurodesenv / data-science-api-framework

User experience facility

Data-Culpa / openclients

tulibraries / cob_datapipeline

korniichuk / workflow

lensesio / lenses-cloud-templates

WeR-stats / workshop-setup_cloud_machine_data_science

kids-first / kf-lambda-quality-reports

datacoon / awesome-dataops

tulibraries / manifold_airflow_dags

nellaivijay / Awesome-AIML-Data-Ops

tulibraries / funcake_dags

stephlocke / dataops

Improve this page

Add this topic to your repo