Data Wrangling

Limiting Data When Columns Change

by Julie Sebby
September 18, 2020September 21, 2020
Data Wrangling, Spotfire
1 Comment

This week I worked with one of our engineers on a Spotfire project where he needed to filter out nulls and zeros from his visualizations. This should be easy, but users set the y-axis columns using a property control. Basic data limiting by expression was impossible because the columns kept changing. Limiting data when columns change is challenging but not impossible. There are two ways to do it. Read on to learn more.

How to Improve Spotfire Load Times

by Julie Sebby
July 21, 2020July 21, 2020
Admin, Automation Services, Data Wrangling, Errors & Troubleshooting, Spotfire
2 Comments

Long load times are a problem all Spotfire users struggle with eventually. The first step to reducing load time involves understanding which tables are taking the longest to load. Spotfire provides this information in the Help menu, but it’s not in a form that is easy to work with. This week, I need to help one of our project teams optimize their Spotfire project, so I built an Excel template to parse this data. Now I know what to go after in order to improve Spotfire load times.

Spotfire Errors Querying Two MS Access Tables

by Julie Sebby
January 8, 2020January 9, 2020
Data Wrangling, Errors & Troubleshooting, Spotfire

A coworker reached out to me this week with an interesting problem. He was trying to import a data set into Spotfire from MS Access… Read More »Spotfire Errors Querying Two MS Access Tables

How To Make Type Curve Inputs More Efficient

by Julie Sebby
October 14, 2019October 15, 2019
Data Wrangling, IronPython, Spotfire
10 Comments

Type Curve generators are standard in reservoir engineering workflows. They are also a bit problematic. They require lots of inputs, which are time-consuming to build and fill out. You can’t easily recreate them in other projects. Therefore, my next two posts will show you how to make type curve inputs more efficient. The first post will demonstrate how to populate type curve inputs with IronPython and toggle between different sets of inputs. The second post will go further by allowing users to load type curve settings in a table. Then, the user tells an R function which row to pull from the table and place in the type curve inputs. Read on for these great solutions!

How to Build Spotfire Data Connections for Multiple Users

by Julie Sebby
September 20, 2019September 30, 2019
Admin, Data Wrangling, Spotfire
3 Comments

This week’s post touches on a subject I’ve never written about before — data connections. More specifically, I’m going to explain how to build Spotfire data connections with a service account. Building data connections with a service account will allow you to create enterprise data connections that aren’t dependent on a single user’s credentials. Multiple users will be able to use data connections stored in the Spotfire library. If that catches your interest, read on.

What are key columns for linked data?

by Julie Sebby
September 12, 2019May 28, 2020
Data Wrangling, Errors & Troubleshooting, Spotfire

Have you noticed key columns for linked data in Data Table Properties and just want to know more?
Have you tried to use replace value transformations but found options grayed out (screenshot above)?
Have you ever lost the marked data behind tags when opening or closing a file?
Would you like to be able to edit values in a data table?

If any of these apply, read on to find how what key columns are and how to use them.

5 Tip for Troubleshooting Data Wrangling Problems

by Julie Sebby
May 8, 2019August 27, 2019
Alteryx, Data Wrangling, Errors & Troubleshooting, Spotfire
1 Comment

I’ve never found a great way to explain what I do. Analogies help right? The first analogous thing that pops into my head is a Rubix cube. My day to day work feels like trying to solve a Rubix cube over and over, and the cube is a big pile of dirty, messy data. Thus, much of my day to day is troubleshooting data wrangling problems.

Some days, it’s incredibly satisfying, as solving a Rubix cube is. Other days, I want to rip the stickers off and throw it out the window. When I drafted this post, I was in “rip the stickers off” mode. I was frustrated with how long it took to solve data wrangling problems. When I get in that mindset, I want to share what I learn so other people don’t suffer as much. Thus, this post discusses my top 5 ways to approach troubleshooting data problems. These methods will apply no matter what application you are working in or if the problem is related to data wrangling or logic problems. Read on for details.

by Julie Sebby
April 29, 2019August 27, 2019
Data Wrangling, R/TERR, Spotfire
5 Comments

I know I said I would revert back to IronPython posts. But, I had to do a bit of TERR work this week that I… Read More »How to Add Calculated Columns With TERR