TacoSkill LAB
TacoSkill LAB
HomeSkillHubCreatePlaygroundSkillKit
© 2026 TacoSkill LAB
AboutPrivacyTerms
  1. Home
  2. /
  3. SkillHub
  4. /
  5. vaex
Improve

vaex

8.7

by davila7

196Favorites
273Upvotes
0Downvotes

Use this skill for processing and analyzing large tabular datasets (billions of rows) that exceed available RAM. Vaex excels at out-of-core DataFrame operations, lazy evaluation, fast aggregations, efficient visualization of big data, and machine learning on large datasets. Apply when users need to work with large CSV/HDF5/Arrow/Parquet files, perform fast statistics on massive datasets, create visualizations of big data, or build ML pipelines that don't fit in memory.

big data

8.7

Rating

0

Installs

Data & Analytics

Category

Quick Review

Excellent skill for big data processing with Vaex. The description clearly articulates when to use this skill (large tabular datasets exceeding RAM), and SKILL.md provides comprehensive guidance with a well-organized structure. The six capability areas are logically presented with clear pointers to reference files, enabling a CLI agent to easily determine which references to load for specific tasks. The Quick Start Pattern and Common Patterns sections provide actionable code examples. Task knowledge is thorough, covering DataFrames, processing, performance, visualization, ML, and I/O with best practices. Structure is exemplary - concise overview in SKILL.md with detailed content delegated to references. Novelty is strong: processing billion-row datasets efficiently requires specialized knowledge that would consume many tokens if a CLI agent attempted this without the skill. Minor deduction on novelty only because some basic DataFrame operations overlap with pandas, though the out-of-core and performance optimization aspects are clearly differentiated and valuable.

LLM Signals

Description coverage9
Task knowledge10
Structure10
Novelty8

GitHub Signals

18,073
1,635
132
71
Last commit 0 days ago

Publisher

davila7

davila7

Skill Author

Related Skills

spark-engineerpandas-proxlsx

Loading SKILL.md…

Try onlineView on GitHub

Publisher

davila7 avatar
davila7

Skill Author

Related Skills

spark-engineer

Jeffallan

6.4

pandas-pro

Jeffallan

6.4

xlsx

mrgoonie

7.2

infographic-syntax-creator

antvis

6.8
Try online