Questions tagged [named-entity-recognition]

20 questions
4
votes
1 answer

How to understand 'losses' in Spacy's custom NER training engine?

From the tid-bits, I understand of neural networks (NN), the Loss function is the difference between predicted output and expected output of the NN. I am following this tutorial, the losses are included at line #81 in the nlp.update() function. I am…
2
votes
1 answer

Does BERT freeze the entire model body when it does fine-tuning?

Recently, I came across the BERT model. I did some research and tried some implementations. I wanted to tackle a NER task, so I chose the BertForSequenceClassifications provided by HuggingFace. for epoch in range(1, args.epochs + 1): total_loss…
2
votes
1 answer

Which AI techniques are there that combine multiple models to make sense of data at different stages?

I have been working to design a system that uses multiple machine learning models to make sense of data that is dynamically webscraped. Each AI would handle a specific task, for example: An AI model would identify text in an image, then attempt to…
2
votes
0 answers

Extracting "hidden" costs from financial statements using NLP

I'm designing a NLP model to extract various kinds of "hidden" expenses from 10-K and 10-Q financial statements. I've come up with about 7 different expense categories (restructuring costs, merger and acquisitions, etc.) and for each one I have a…
2
votes
1 answer

Is it possible to create a named entity recognition system without using POS tagging in the corpus?

Is it possible to create a named entity recognition system without using POS tagging in the corpus?
2
votes
2 answers

What are the main ideas behind NER?

Named entity recognition (NER), also known as entity chunking/extraction, is a popular technique used in information extraction to identify and segment the named entities and classify or categorize them under various predefined classes. Briefly,…
Pluviophile
  • 1,293
  • 7
  • 20
  • 40
1
vote
0 answers

Can I use Generative AI to reliably extract data (vs. Extractive AI)?

It seems that nowadays lots of people building tools to extract data from different sources (e.g. PDF reports) with the help of models like GPT-4, LLama or Falcon. I was wondering, if this really works, because from what I know, you could train…
T-Dog
  • 11
  • 1
1
vote
1 answer

How to extract product codes within unstructured data

I am working on a project where I have a dataset consisting of unstructured data from multiple ERP systems. Each dataset (extracted from an ERP) has different columns, and unfortunately, there is no standard format for the data. Among the columns,…
delucca
  • 11
  • 4
1
vote
0 answers

How can I use this Reformer to extract entities from a new sentence?

I have been looking at the NER example with Trax in this notebook. However, the notebook only gives an example for training the model. I can't find any examples of how to use this model to extract entities from a new string of text. I've tried the…
1
vote
0 answers

Determining if an entity in free text is 'present' or 'absent'; what is this called in NLP?

I'm processing a semi-structured scientific document and trying to extract some specific concepts. I've actually made quite good progress without machine-learning so far, but I got to a block of true free text and I'm wondering whether a very narrow…
1
vote
0 answers

Are there any good resources (preferably books) about techniques used for entity extraction?

Given some natural language sentences like I would like to talk to Mr. Smith I would like to extract entities, like the person "Smith". I know that frameworks, which are capable of doing so (f. e. RASA or spaCy), exist, but I would like to dive…
0
votes
0 answers

MTL model for find entity names and make corrections

I want to train an MTL model with NER and a seq2seq architecture to identify entity names and correct typos within them. The NER model is word-based, while the seq2seq model operates at the character level. The NER model performs well, but the…
0
votes
1 answer

How to label missing/default values for a named entity recognition dataset

I am building the training dataset for a named entity recognition model, with 2 tags: Name and Category and I am using a pre-trained spaCy model. Given a document, the model needs to extract the name and category of several items. However, the name…
0
votes
1 answer

Is there a way to find a list of associated words to a named entity?

Is there a way to find a list of associated words to a named entity? For instance, let the named entity be FIFA. Now FIFA is a Football Organization and hence related to the term football and all the terms related to football like jersey,…
0
votes
1 answer

Confused about NER evaluation

I'm reading from the book Speech and Language Processing by Dan Jurafsky and James H. Martin and I've stuck for a while trying to understand what the authors mean by the following The fact that named entity tagging has a segmentation component…
1
2