Data Study & Tallying Workflow – Sundar Mishra
I believe that data study and data tallying are hard because you cannot solve data issues just by overworking. You have to fix them with a clear, calm mind and a smarter approach. Once you find a data pattern and learn some Excel functions, you can easily discover ways to do calculations with your data.
Yes, one funny thing is that in the first few days, you may even see formulas, numbers, or patterns in your dreams or in daily life — but these are normal.
Some of my ideas and workflows I have documented here.
Step 0: Data Import & Backup
AlwaysDescription:
Before saveworking aon rawdata, copyalways beforepreserve starting.
Document sourcewhere details:the filedata name,came reportfrom, name,note extractionits date,version (V1, V2, etc.), and keep backups. This avoids confusion when multiple files are generated or when older data is required later.
Step 1: Data Cleaning & Validation
CleanDescription:
Raw anddata standardizeusually contains extra spaces, inconsistent text, wrong data types, or formatting issues. Cleaning ensures the dataset is standardized and trustworthy, without alteringchanging its actual meaning. After cleaning, totals must match the original so accuracy is not lost.
-
TRIM()
→ Remove unwanted spaces. -
CLEAN()
→ Remove invisible/non-printable characters. -
UPPER()
,/LOWER()
,/PROPER()
→ Standardize text case. -
TEXT(
→)value, format_text)FormatConvertvaluesnumbers/dates(e.g.,intodates,consistentnumbers).format. -
VALUE()
→ Convert stored-as-text numberstointo numericformat.type. -
DATEVALUE()
→ Convert text datestointo realdates.date values. -
ROUND()
,ROUNDUP()
,ROUNDDOWN()
→EnsureControlconsistentdecimalsdecimal places.consistently. -
ISNUMBER()
,ISTEXT()
,ISERROR()
→ Validate data type.
🔑 Checks:Check:
- Use
TotalsSUM()
before&and after cleaning → totals mustmatchremain(
theSUM()check).same.Define correctdata types: Date, Number, Text.
Step 2: Understanding Data & Setting Goal
- cleaning,
whatVerifyunderstanddatathesourcedataset: where it came from, what it contains, andreliability.you - want
Identifyto achieve. Look fordata patternpatterns (repeatedfields,codes,keys,categories,categories)unique IDs). Once - the
Definegoalofistallyclear,(e.g.,planreconciling totals, validating balances). If needed, ask senior/AI toolshow tosolveachieve it. If unsure, learn the method first (from seniors or AI),notthento dosolve itfor you.yourself.
Description:
After
🔧 Helpful Formulas for Exploration:Formulas:
-
COUNT()
,→ Count numeric records. -
COUNTA()
,→ Count all non-blank cells. -
COUNTBLANK()
→ Countrecords.empty cells. -
UNIQUE()
→IdentifyExtract unique values (Excel 365+). -
SORT()
/SORTBY()
→DetectArrangeorderingdata&topatterns.spot trends. -
LEN()
→SpotCheck for inconsistent text lengths (e.g., IDcodes)digits). -
SEARCH()
/FIND()
→ Locate a substring inside text. -
LEFT()
,RIGHT()
,MID()
→ Extract parts of codes. -
FREQUENCY()
orMODE()
→ Detect repetition patterns.
Step 3: Tallying & Multi-way Verification
PerformDescription:
Now tallyperform tallying: compare, sum, and checkreconcile. inNever morerely thanon only one way.method of verification — check totals, subtotals, and record-level matches. If different checks give the same result, you can be confident the data is correct.
-
SUM()
→ Total values. -
SUBTOTAL()
→DynamicTotalstotalsthat(ignoresignore filtered/hiddenrows).rows. -
SUMIFS()
→ Conditionalsumming.totals. -
COUNTIFS()
→ Conditionalcounting.counts. -
VLOOKUP()
,/HLOOKUP()
→CompareLookup valuesacrossin other reports. -
INDEX() + MATCH()
→FlexibleMorelookups.flexible lookup alternative. -
XLOOKUP()
(ifExcelavailable)365+) → Modern lookupalternative.function. -
IF()
/IFS()
→ConditionalApplychecks.conditional logic. -
IFERROR()
→HandleReturnmismatchessafegracefully.output if formula errors. -
EXACT()
→ Comparetwotextfieldsvaluesexactly.exactly (case-sensitive). -
TEXTJOIN()
→ConcatenateCombine multiple fields forpatternmatching.
🔑 Cross-verificationVerification techniques:Techniques:
-
Totals vs. subtotals (
SUMIFS()
vs.vsSUM()
check)). -
Record-
levelto-record check (VLOOKUP()
/XLOOKUP()
). -
Duplicate
detectioncheck (COUNTIFS()
> 1). -
MismatchHighlighthighlightingmismatches(→ conditional formatting withformulas).formula rules.
Step 4: Finalization & Reporting
- verified,
Confirmthedatadatasetismusterror-freebe finalized andconsistent.reported. The report should be clean, summarized, and easy for others to read. Use pivots, charts, and formatting to present results clearly.
Description:
Once
🔧 Useful Reporting& Tools:
-
Pivot Table → Summarize large data quickly (
withtotals,SUM,COUNT,counts, %of Total)). -
ChartsPivotforChartvisualization.→ Visual representation of summaries. -
TEXT()
with→customPresent numbers in clean formatsfor(e.g.,clean"1,234.00").report -
Conditional Formatting → Highlight exceptions.
-
ROUND()
/INT()
→ For neat presentation of figures.
📌 Final Principle: Data is finalized only if multiple verification methods confirmagree.
Optional: Smart Work Habits
Optional: I’d recommend that if you are working with data, you should learn some very important Excel functions. This includes formulas, Pivot Tables, printing options, and keyboard shortcuts. Yes, learning these may be a little challenging at first, but once you master them, you will never get stuck on data problems.
Nobody is perfect in their work; they simply try to improve by learning and solving new issues. If you work with the same result.type of data repeatedly, you shouldn’t waste time recalculating it manually every time. Instead, you can create a template: paste your cleaned data in one place, and let the template perform comparisons or calculations automatically. This approach will help you work faster and more efficiently in data analysis.
✅ End Result: Cleaned,Clean, reliable, and thoroughly verified data ready for reporting, auditing, or deeperfurther analysis.