Group Abstract

Message Boards

3.6K Views

5 Replies

0 Total Likes

View groups...

Follow this post

Share this post:

GROUPS:

Import and Export Wolfram Language Natural Language Processing

Richard Gordon, Retired

Posted 3 years ago

I have written a Mathematica program to read in titles with EndNote tags using a custom EndNote Style and Word. But I seem to have to convert the Word file to plain text for Mathematica input. I have about 10,000 titles to scan. I want to isolate those that have two consecutive words in italics. Those most likely represent Genus species. Is there any way to do this?

POSTED BY: Richard Gordon

5 Replies

Sort By:

Posted 3 years ago

Oh, sorry, I just now realized that you want the titles, not just the binomials. We'll need to figure out how to split the cell contents into separate titles (maybe split on line-return?) and then select the ones that contain the style box that satisfies our condition. Out of time at the moment, but I try to remember to revisit this later.

POSTED BY: Eric Rimbey

Posted 3 years ago

POSTED BY: Eric Rimbey

Posted 3 years ago

Y2022m08d12, Alonsa, Manitoba, Canada Dear Eric, I checked, and Mathematica is right about this species. Give me a few days to digest what you've written. By the way, I'm working on: Gordon, R., Deb, M. and Gordon, N.K. (2023) Origin of Life via Archaea: Shaped Droplets to Archaea First, With a Compendium of Archaea Micrographs [OOLA, Volume in the series Astrobiology Perspectives on Life of the Universe, Eds. Richard Gordon & Joseph Seckbach, in preparation]. Wiley-Scrivener, Beverly, Massachusetts, USA. Your followup suggests more than programming interest! Thanks. Yours, -Richard (Dick) Gordon DickGordonCan@protonmail.com RichardGordonCan@xplornet.com Talk: https://meet.jit.si/DickGordonMeeting (arrange time first by e-mail or holler if I'm on) http://orcid.org/0000-0003-4970-9953 Canada: 1-(204) 767-2164 http://tinyurl.com/RichardGordonBooks fertilizer: https://www.youtube.com/watch?v=LMG4kuEN_kM

POSTED BY: Richard Gordon

Posted 3 years ago

POSTED BY: Eric Rimbey

Posted 3 years ago

Dear Eric, Kind of you to offer! I'm a rank Mathematica amateur. It took me weeks to write a program that created a list of titles, with the words cyclically permuted. I selected some titles with and without italicized Genus species, showing the desired output. I own Mathematica 12.1. Thanks. Yours, -Richard (Dick) Gordon DickGordonCan@protonmail.com RichardGordonCan@xplornet.com Talk: https://meet.jit.si/DickGordonMeeting (arrange time first by e-mail or holler if I'm on) http://orcid.org/0000-0003-4970-9953 Canada: 1-(204) 767-2164 http://tinyurl.com/RichardGordonBooks fertilizer: https://www.youtube.com/watch?v=LMG4kuEN_kM Attachments: Finding Genus sp...docx

POSTED BY: Richard Gordon

Reply to this discussion

Reply Preview

Attachments

Remove Add a file to this post

Follow this discussion

or Discard

Feedback