Scan columns for keyword and extract all values in that column if the keyword is found in that column

306 views

Skip to first unread message

A S

unread,

Sep 25, 2019, 4:09:50 AM9/25/19

to python-excel

Hi there,

Thank you for spending your time here and helping out, I really appreciate it.

Currently, I have multiple excel files to loop through.

I am only looping through columns C:D. If the column has the keyword "Abbreviation", I want to extract all the values in that column.

This is because my keyword could exist in either column C or D

My columns will look something like this:

Col C Col D

Abbreviation	Description
urre_no	sdS
BELLO	helllo

Col C Col D

Description	Abbreviation
hehehe	abc
PEARRR	ab33_2_
FJF	a_rr
kkkkk	IIRR_VF
llkl	jk_ff

After importing my excel files, here is how I loop through the columns that I want to scan through:

wb1 = load_workbook(join(dict_folder, file), data_only = True)

ws = wb1.active

for rowofcellobj in ws["C":"D"]:

for cellobj in rowofcellobj:

if cellobj.value == "Abbreviation":

# extract all words in that column but Idk how to execute this step or if my above steps are correct

if cellobj.value is not None:

data = re.findall(r"\b\w+_.*?\w+|[A-Z]*$\b", str(cellobj.value))

#filtering out blank rows here:

if data != [ ]:

if data != [' ']:

#extracting words from square brackets in list:

fields = data[0]

print(fields)

I am stuck at the area which I had commented in red..

Chris Wakibi

unread,

Sep 25, 2019, 5:07:43 AM9/25/19

to python...@googlegroups.com

Hi,

You are supposed to loop through the column that has the "Keyword" you are referring to.

Find the maximum row, then loop through from 1 to the maximum and access the column values as below;

sheet['C1'].value where C is the Column and 1 is the row number

--
You received this message because you are subscribed to the Google Groups "python-excel" group.
To unsubscribe from this group and stop receiving emails from it, send an email to python-excel...@googlegroups.com.
To view this discussion on the web, visit https://groups.google.com/d/msgid/python-excel/a4f48c67-d382-40b8-9925-da314592a5e3%40googlegroups.com.

Kind Regards

Chris

Reply all

Reply to author

Forward

0 new messages