- Import data
- Task: Load your transactions file.
- Operator: Import Dataset / Read CSV (Read Excel)
- Handle missing values
- Task: Fill or remove rows with missing Items.
- Operator: Replace Missing Values or Filter Examples
- Remove duplicates
- Task: Drop exact duplicate transactions.
- Operator: Remove Duplicates
- Fix mistaken values & trim spaces
- Task: Correct typos and remove extra spaces from item names.
- Operator: Map / Replace (or Generate Attributes); Trim (string)
- Ensure correct data types
- Task: Make sure Items is a string/nominal column.
- Operator: Change Data Type (if needed)
- Drop unneeded columns
- Task: Remove Transaction ID or other irrelevant columns.
- Operator: Select Attribute / Select Attributes
- Split Items into separate item columns
- Task: Turn comma-separated Items into item columns.
- Operator: Split (attribute = Items, separator = comma, mode = unordered_split)
- Convert split columns to binary form
- Task: Make each item column show presence/absence (0/1 or true/false).
- Operator: Nominal to Binomial (or Numerical to Binominal)
- (Optional) Clean column names
- Task: Remove prefixes like Items_ from column names.
- Operator: Rename by Replacing
- Find frequent itemsets (FP‑Growth)
- Task: Run FP‑Growth with min support.
- Operator: FP‑Growth
- Key params: min requirement = support; min_support = (use relative like 0.22 or 0.5, or absolute like 2 or 5); positive value = true
- Output frequent sets
- Task: Connect frequent_sets output to view results.
- Operator: (use the FP‑Growth operator’s frequent sets → Result)
- Create association rules
- Task: Generate rules from frequent itemsets using confidence threshold.
- Operator: Create Association Rules
- Key params: criterion = confidence; min_confidence = (e.g., 60 or 0)
- Verify & export results
- Task: Check rule support/confidence; export results if needed.
- Operator: Results panel → Export (CSV)
Quick mnemonics:
- Clean → Split → Binary → FP‑Growth → Rules
- Or: Import → Clean → Split → Binomial → FP‑Growth → Rules
Quick mapping table — Task → Altair operator → RapidMiner operator
- Read/import data: Import Dataset → Read CSV/Read Excel
- Replace missing / Fill missing: Replace Missing Values → Replace Missing Values / Filter Examples
- Filter out missing rows: Filter Examples → Filter Examples
- Remove duplicates: Remove Duplicate → Remove Duplicates
- Correct mistaken values: Map / Generate Attributes → Replace / Generate Attributes / Map
- Trim spaces: Trim (string) → Generate Attributes (trim function) / Replace
- Change data type: Change Data Type → Nominal/Numeric converters
- Drop column: Select Attribute → Select Attributes / Remove Attributes
- Convert categorical → binary matrix: Split → Split / Generate Attributes / Text Processing
- Convert nominal → binomial: Nominal to Binomial → Numerical to Binominal
- Rename column parts: Rename by Replacing → Rename / Rename by Replacing
- FP‑Growth: FP‑Growth → FP‑Growth
- Create rules: Create Association Rules → Create Association Rules
