splitting-datasets

5.8

148

Process split datasets into training, validation, and testing sets for ML model development. Use when requesting "split dataset", "train-test split", or "data partitioning". Trigger with relevant phrases based on skill purpose.

data-splitting

5.8

Rating

Installs

Machine Learning

Quick Review

This skill provides a clear description of dataset splitting functionality with good coverage of use cases and examples. The description adequately explains when and how to invoke the skill for train-test-validation splits. Task knowledge appears sufficient with referenced scripts (split_data.py, config files, and examples) that would contain implementation details. The structure is reasonable with a clear overview, though SKILL.md includes some generic boilerplate sections that add clutter. However, novelty is limited as dataset splitting is a straightforward task that CLI agents can accomplish with standard libraries like scikit-learn in relatively few tokens, making the cost-reduction benefit modest for this common ML operation.