RapidMiner

  1. Import data
  • Task: Load your transactions file.
  • Operator: Import Dataset / Read CSV (Read Excel)
  1. Handle missing values
  • Task: Fill or remove rows with missing Items.
  • Operator: Replace Missing Values or Filter Examples
  1. Remove duplicates
  • Task: Drop exact duplicate transactions.
  • Operator: Remove Duplicates
  1. Fix mistaken values & trim spaces
  • Task: Correct typos and remove extra spaces from item names.
  • Operator: Map / Replace (or Generate Attributes); Trim (string)
  1. Ensure correct data types
  • Task: Make sure Items is a string/nominal column.
  • Operator: Change Data Type (if needed)
  1. Drop unneeded columns
  • Task: Remove Transaction ID or other irrelevant columns.
  • Operator: Select Attribute / Select Attributes
  1. Split Items into separate item columns
  • Task: Turn comma-separated Items into item columns.
  • Operator: Split (attribute = Items, separator = comma, mode = unordered_split)
  1. Convert split columns to binary form
  • Task: Make each item column show presence/absence (0/1 or true/false).
  • Operator: Nominal to Binomial (or Numerical to Binominal)
  1. (Optional) Clean column names
  • Task: Remove prefixes like Items_ from column names.
  • Operator: Rename by Replacing
  1. Find frequent itemsets (FP‑Growth)
  • Task: Run FP‑Growth with min support.
  • Operator: FP‑Growth
  • Key params: min requirement = support; min_support = (use relative like 0.22 or 0.5, or absolute like 2 or 5); positive value = true
  1. Output frequent sets
  • Task: Connect frequent_sets output to view results.
  • Operator: (use the FP‑Growth operator’s frequent sets → Result)
  1. Create association rules
  • Task: Generate rules from frequent itemsets using confidence threshold.
  • Operator: Create Association Rules
  • Key params: criterion = confidence; min_confidence = (e.g., 60 or 0)
  1. Verify & export results
  • Task: Check rule support/confidence; export results if needed.
  • Operator: Results panel → Export (CSV)

Quick mnemonics:

  • Clean → Split → Binary → FP‑Growth → Rules
  • Or: Import → Clean → Split → Binomial → FP‑Growth → Rules

Quick mapping table — Task → Altair operator → RapidMiner operator

  • Read/import data: Import Dataset → Read CSV/Read Excel
  • Replace missing / Fill missing: Replace Missing Values → Replace Missing Values / Filter Examples
  • Filter out missing rows: Filter Examples → Filter Examples
  • Remove duplicates: Remove Duplicate → Remove Duplicates
  • Correct mistaken values: Map / Generate Attributes → Replace / Generate Attributes / Map
  • Trim spaces: Trim (string) → Generate Attributes (trim function) / Replace
  • Change data type: Change Data Type → Nominal/Numeric converters
  • Drop column: Select Attribute → Select Attributes / Remove Attributes
  • Convert categorical → binary matrix: Split → Split / Generate Attributes / Text Processing
  • Convert nominal → binomial: Nominal to Binomial → Numerical to Binominal
  • Rename column parts: Rename by Replacing → Rename / Rename by Replacing
  • FP‑Growth: FP‑Growth → FP‑Growth
  • Create rules: Create Association Rules → Create Association Rules

Recent Articles

I BUILT THIS WITH WORDPRESS

Related Stories

Leave a Reply

Stay on op - Ge the daily news in your inbox