Skip to main content

python re extract smallest substring between two strings [duplicate]

Sorry in advance for a potentially duplicate question, I'm somewhat new to re and can't find an answer

I have a string something like this:

<foo><bar><biz><Baz><buz>Extract Me!</span><foo><bar>

I want to extract Extract Me!, which is between a > and the only </span> that appears in the string. I tried >(.*)</span>, but that extracts ><bar><biz><Baz><buz>Extract Me!</span>.

Edit:

1:

This question got closed, linking to this as a duplicate, but making the regular expression >(.*)</span> "non greedy" by turning it into >(.*?)</span> yields the same result. I had already attempted this before posting.

2:

After some discussions, I was recommended to just use BeautifulSoup, which makes sense. I've solved the issue with re.search(r'(?:>)(\b.*)(<\/span>)', but I'll provide a bit more code so further exploration can be done.

So:

Unveiling the curtain a bit, this is the pseudo code of what I'm working with:

src = selenium_driver.page_source
soup = BeautifulSoup(src) 
list_of_things = soup.findAll(True, {'class':['list of classes']})
for thing in list_of_things:
  print(type(thing))
  print(thing)
  extract_extractMe() # <- do stuff

The result of print(type(thing)) and print(thing) would be something like this:

#type
<class 'bs4.element.Tag'>

#thing
<li class="property-item"><div class="property-text"><span data-spm-anchor-id="a2g0o.detail.1000016.i2.11ab42d1npvRCb">14CM</span></div></li>

I'm trying to extract 14CM from each "thing"



source https://stackoverflow.com/questions/74892989/python-re-extract-smallest-substring-between-two-strings

Comments

Popular posts from this blog

Prop `className` did not match in next js app

I have written a sample code ( Github Link here ). this is a simple next js app, but giving me error when I refresh the page. This seems to be the common problem and I tried the fix provided in the internet but does not seem to fix my issue. The error is Warning: Prop className did not match. Server: "MuiBox-root MuiBox-root-1" Client: "MuiBox-root MuiBox-root-2". Did changes for _document.js, modified _app.js as mentioned in official website and solutions in stackoverflow. but nothing seems to work. Could someone take a look and help me whats wrong with the code? Via Active questions tagged javascript - Stack Overflow https://ift.tt/2FdjaAW

How to show number of registered users in Laravel based on usertype?

i'm trying to display data from the database in the admin dashboard i used this: <?php use Illuminate\Support\Facades\DB; $users = DB::table('users')->count(); echo $users; ?> and i have successfully get the correct data from the database but what if i want to display a specific data for example in this user table there is "usertype" that specify if the user is normal user or admin i want to user the same code above but to display a specific usertype i tried this: <?php use Illuminate\Support\Facades\DB; $users = DB::table('users')->count()->WHERE usertype =admin; echo $users; ?> but it didn't work, what am i doing wrong? source https://stackoverflow.com/questions/68199726/how-to-show-number-of-registered-users-in-laravel-based-on-usertype

Why is my reports service not connecting?

I am trying to pull some data from a Postgres database using Node.js and node-postures but I can't figure out why my service isn't connecting. my routes/index.js file: const express = require('express'); const router = express.Router(); const ordersCountController = require('../controllers/ordersCountController'); const ordersController = require('../controllers/ordersController'); const weeklyReportsController = require('../controllers/weeklyReportsController'); router.get('/orders_count', ordersCountController); router.get('/orders', ordersController); router.get('/weekly_reports', weeklyReportsController); module.exports = router; My controllers/weeklyReportsController.js file: const weeklyReportsService = require('../services/weeklyReportsService'); const weeklyReportsController = async (req, res) => { try { const data = await weeklyReportsService; res.json({data}) console