String Transforms
The String transforms allow users to manipulate textual data, standardize formats, and enrich string columns for analysis or downstream processing.
Add Prefix
Adds a custom text prefix to each value in the selected column(s).
Best Situations to Use
Standardize identifiers or codes with common prefixes.
Add contextual labels to string columns for feature engineering.
Steps
Select a string column.
Open Transforms > String > Add Prefix.
Enable Output Column Prefix (optional) to create a new column.
Provide the Prefix Value.
Select one or multiple columns to apply.
Click Submit.
Add Suffix
Adds a custom text suffix to each value in the selected column(s).
Best Situations to Use
Append units, labels, or contextual text to string columns.
Standardize string formatting for reporting or analysis.
Steps
Select a string column.
Open Transforms > String > Add Suffix.
Enable Create New Column (optional).
Provide the Suffix Value.
Select one or multiple columns to apply.
Click Submit.
Change to Lowercase
Converts all values in a column to lowercase.
Best Situations to Use
Standardize text for case-insensitive comparisons.
Prepare data for text analytics, search, or deduplication.
Steps
Select a string column.
Open Transforms > String > Change to Lowercase.
Enable Create New Column (optional).
Click Submit.
Change to Title Case
Converts values so that the first letter of each word is capitalized.
Best Situations to Use
Improve readability and formatting of string data.
Standardize proper nouns or textual descriptions.
Steps
Select a string column.
Open Transforms > String > Change to Title Case.
Enable Create New Column (optional) and provide a name.
Click Submit.
Change to Upper Case
Converts all values in a column to uppercase.
Best Situations to Use
Standardize text for visual consistency or case-sensitive operations.
Prepare data for integration with other systems requiring uppercase text.
Steps
Select a string column.
Open Transforms > String > Change to Upper Case.
Enable Create New Column (optional) and provide a name.
Click Submit.
Note: If Create New Column is not enabled, the original column is modified.
Ends With
Checks if values in a column end with a specified string or pattern, returning a Boolean result.
Best Situations to Use
Flag rows with specific suffixes or patterns in string columns.
Prepare features for categorical analysis or text filtering.
Steps
Select a string column.
Open Transforms > String > Ends With.
Select Mode:
REGEX
orString
.Provide New Column Name (if creating a new column).
Enter the value or pattern.
Enable Ignore Case (optional).
Click Submit.
Example: Pattern .*pire
in REGEX mode → True
for values ending with pire
.
Extract Substring at Position
Extracts a substring from a string column based on a starting position and length.
Best Situations to Use
Retrieve specific portions of text for analysis or feature engineering.
Standardize or parse structured strings.
Steps
Select a string column.
Open Transforms > String > Extract Substring at Position.
Enable Create New Column (optional).
Provide New Column Name.
Specify Position (0-indexed start).
Specify Length of substring.
Click Submit.
Extract Substring before Delimiter
Extracts a substring from a string column before a specified delimiter occurrence.
Best Situations to Use
Parse structured strings like CSV, IDs, or codes.
Extract leading segments before separators.
Steps
Select a string column with delimiters.
Open Transforms > String > Extract Substring before Delimiter.
Enable Create New Column (optional).
Provide New Column Name.
Specify Delimiter.
Specify the Count of delimiter occurrences.
Click Submit.
Get Domain
Extracts the domain portion of a URL from a string column.
Best Situations to Use
Prepare URL data for analytics or aggregation by domain.
Feature engineering for web log or traffic analysis.
Steps
Select a URL column.
Open Transforms > String > Get Domain.
Click Submit.
Example: https://www.google.com
→ google
.
Get Host
Extracts the host (including subdomains) from a URL column, excluding path and query parameters.
Best Situations to Use
Analyze web traffic or sources by host.
Separate subdomains for detailed URL-based analytics.
Steps
Select a URL column.
Open Transforms > String > Get Host.
Click Submit.
Example: https://www.google.com
→ www.google.com
Get Subdomain Name
Extracts the subdomain portion from a URL column.
Best Situations to Use
Analyze subdomain-level traffic.
Prepare features for web analytics or segmentation.
Steps
Select a URL column.
Open Transforms > String > Get Subdomain Name.
Click Submit.
Example: https://www.google.com
→ www
.
Insert Character
Inserts a custom character at specified positions in a string column. Can update the original column or create a new one.
Best Situations to Use
Format codes, IDs, or textual values with prefixes, separators, or markers.
Standardize data for integration or display purposes.
Steps
Select a string column.
Open Transforms > String > Insert Character.
Enable Create New Column (optional).
Provide New Column Name.
Specify Position(s) to insert the character (comma-separated for multiple positions).
Enter the Character to insert.
Click Submit.
Example: Insert $
at position 0
→ $value
.
Negate Boolean Value
Flips the Boolean values in a column: True
→ False
, False
→ True
.
Best Situations to Use
Correct or invert Boolean flags in datasets.
Prepare features for conditional logic or ML models.
Steps
Select a Boolean column.
Open Transforms > String > Negate Boolean Value.
Click Submit.
Remove Accents
Removes accented characters from string columns, normalizing text.
Best Situations to Use
Standardize multilingual text for comparison or matching.
Prepare text for search, NLP, or downstream processing.
Steps
Select a string column.
Open Transforms > String > Remove Accents.
Enable Create New Column (optional).
Click Submit.
Remove Consecutive Characters
Removes repeated characters or whitespace from a column.
Best Situations to Use
Normalize text with duplicate characters or spaces.
Prepare data for clean textual analysis.
Steps
Select a column.
Open Transforms > String > Remove Consecutive Characters.
Enable Create New Column (optional).
Configure:
Separator:
Whitespace
orOther
Custom Repeated Character (if
Other
is selected)
Click Submit.
Remove Part of Text
Removes a specific substring from a column based on a condition.
Best Situations to Use
Delete unwanted patterns or prefixes/suffixes.
Standardize text before analysis.
Steps
Select a column.
Open Transforms > String > Remove Part of Text.
Enable Create New Column (optional).
Specify:
Operator: contains, equals, starts with, ends with, regex
Value: Text or pattern to remove
Click Submit.
Remove Trailing and Leading Characters
Removes leading and trailing characters, including spaces or custom padding characters.
Best Situations to Use
Clean text with extraneous padding.
Standardize identifiers or codes for processing.
Steps
Select a column.
Open Transforms > String > Remove Trailing and Leading Characters.
Enable Create New Column (optional).
Configure:
Padding Character: Space or Other
Custom Character (if Other selected)
Ignore Case (optional)
Click Submit.
Search & Replace
Searches for a value or pattern in a column and replaces it with another value.
Best Situations to Use
Correct inconsistent text entries.
Standardize strings before analysis or aggregation.
Steps
Select a column.
Open Transforms > String > Search & Replace.
Enable Create New Column (optional).
Configure:
Operator: contains, equals, starts with, ends with, regex
Value: Text to search
Replace With: Text to replace
Overwrite Entire Cell (optional)
Click Submit.
Split String
Splits a column into multiple columns based on a delimiter or specific position.
Best Situations to Use
Separate structured text like codes, CSVs, or compound identifiers.
Extract multiple components from a single column.
Steps
Select a column.
Open Transforms > String > Split String.
Configure:
Use With: Delimiter or Position
Delimiter: Character to split on (if selected)
Position: Character position to split (if selected)
Click Submit.
Starts With
Checks if values in a column start with a specified string or pattern, returning a Boolean result.
Best Situations to Use
Flag rows with specific prefixes or patterns.
Prepare features for categorical analysis or filtering.
Steps
Select a string column.
Open Transforms > String > Starts With.
Enable Create New Column (optional).
Configure:
Mode: REGEX or String
Value: Text or pattern to check
Ignore Case (optional)
Click Submit.
Last updated