dask

8.7

280

Parallel/distributed computing. Scale pandas/NumPy beyond memory, parallel DataFrames/Arrays, multi-file processing, task graphs, for larger-than-RAM datasets and parallel workflows.

parallel computing

8.7

Rating

Installs

Data & Analytics

Quick Review

Exceptional skill documentation for Dask parallel computing. The SKILL.md provides comprehensive coverage of when and how to use each Dask component (DataFrames, Arrays, Bags, Futures, Schedulers) with clear decision guides, practical examples, and critical performance rules. The structure is exemplary—concise overview with well-organized sections while deferring deep details to referenced files. Task knowledge is thorough, including workflow patterns, common pitfalls, debugging strategies, and integration considerations. The skill is highly novel as parallel/distributed computing with proper scheduler selection, chunking strategies, and memory management would consume substantial tokens and require deep expertise if done ad-hoc by a CLI agent. Minor room for improvement in the description field itself (could be slightly more explicit about the five components), but overall this is a well-crafted, production-ready skill that meaningfully reduces complexity and token costs for large-scale data processing tasks.

LLM Signals

Description coverage9

Task knowledge10

Structure10

Novelty8

GitHub Signals

18,073

1,635

132

Last commit 0 days ago

Publisher

davila7

Skill Author

Related Skills

spark-engineer pandas-pro xlsx

Loading SKILL.md…

Try onlineView on GitHub

Publisher

davila7

Skill Author

Related Skills

spark-engineer

Jeffallan

6.4

pandas-pro

Jeffallan

6.4

xlsx

mrgoonie

7.2

infographic-syntax-creator

antvis

6.8

Try online

Improve