aguynamedryan bookmarks: data - page: 1urn:uuid:{136C771E-3E6F-B26B-0A1D-A7719C18D59E}2024-03-28T22:28:37Zdatafuselabs/datafuse: An elastic and scalable Cloud Warehouse, offers Blazing Fast Query and combines Elasticity, Simplicity, Low cost of the Cloud, built to make the Data Cloud easy7477012021-08-16T01:32:14ZZ810aguynamedryanapache/arrow-datafusion: Apache Arrow DataFusion and Ballista query engines7477002021-08-16T01:31:11ZZ810aguynamedryanq - Text as Data6843752021-06-14T15:33:23ZZ810aguynamedryandata-cleaning/validate: Professional data validation for the R environment6841092021-05-23T03:44:53ZZ810aguynamedryanropensci/skimr: A frictionless, pipeable approach to dealing with summary statistics6841082021-05-23T03:44:32ZZ810aguynamedryanchoonghyunryu/dlookr: Tools for Data Diagnosis, Exploration, Transformation6841072021-05-23T03:43:59ZZ810aguynamedryandata-cleaning/dcmodify: Modify data records using separately defined modification rules6841062021-05-23T03:43:08ZZ810aguynamedryandata-cleaning/deductive: Methods for deductive data correction and imputation6841052021-05-23T03:42:42ZZ810aguynamedryandata-cleaning/errorlocate: Find and replace erroneous fields in data using validation rules6841042021-05-23T03:41:43ZZ810aguynamedryanIntroducing Amazon S3 Object Lambda – Use Your Code to Process Data as It Is Being Retrieved from S3 | AWS News Blog6834162021-04-16T17:07:32ZZ810aguynamedryanUsing S3 Object Lambdas to Generate and Transform on the fly | by Eoin Shanaghy | Mar, 2021 | Medium6785822021-04-01T02:42:07ZZ810aguynamedryanA Data Pipeline Is a Materialized View | Hacker News5740692021-03-01T03:40:03ZZ810aguynamedryanEstuary Flow (Preview) — Estuary Flow (Preview) documentation5740682021-03-01T03:39:33ZZ810aguynamedryanBuilding Rich Terminal Dashboards | Hacker News5740622021-03-01T03:30:59ZZ810aguynamedryanShow HN: I wrote a book about using data science to solve “everyday” problems | Hacker News5740612021-03-01T03:28:57ZZ810aguynamedryanHierarchical Structures in PostgreSQL4884452021-01-20T22:19:40ZZ810aguynamedryanDatasette: An open source multi-tool for exploring and publishing data4852482020-12-23T19:21:20ZZ810aguynamedryanUsing PostgreSQL and SQL to Randomly Sample Data4365962020-10-28T16:22:58ZZ810aguynamedryanWikidata:SPARQL query service/A gentle introduction to the Wikidata Query Service - Wikidata4263002020-10-26T16:57:54ZZ810aguynamedryanSimple Anomaly Detection Using Plain SQL | Haki Benita3881172020-09-25T17:45:53ZZ810aguynamedryanDuckDB - An embeddable SQL OLAP database management system3663872020-08-21T15:29:03ZZ810aguynamedryanmlin/GenomicSQLite: Genomics Extension for SQLite3663862020-08-21T15:28:40ZZ810aguynamedryanRunning Awk in parallel to process 256M records3217572020-06-05T17:22:49ZZ810aguynamedryanTXR Language3095842020-04-19T18:46:37ZZ810aguynamedryanWhat's new in Kiba ETL v3 (visually explained)2907222020-03-12T23:01:49ZZ810aguynamedryanthewhitetulip/awk-anti-textbook: learn awk by example2852742020-03-09T18:10:46ZZ810aguynamedryanPreparing your Postgres data for scale-out - DEV Community 2832372020-02-26T16:23:22ZZ810aguynamedryanIn Loving Memory of Strictly-Typed Schemas - ssense-tech - Medium2831752020-02-21T19:38:53ZZ810aguynamedryanCommand Line Tricks For Data Scientists2777992019-10-04T21:42:44Z2021-06-14T08:56:28Z810aguynamedryan