GitXplorerGitXplorer
k

recon

public
106 stars
2 forks
7 issues

Commits

List of commits on branch main.
Unverified
ecdfa43b40ef74c08fbc7e81e10eae3d74e99855

add dev reqs to git

committed a year ago
Unverified
02618e1dfdd514a3ebf63faa4b358cb7e6f6a514

just compile reqs so they get installed

committed a year ago
Unverified
390d68b1a58e25ee4f76bc8433a4356cf5a5188b

fix gha

committed a year ago
Unverified
4e024f05da6bd4be5b704291af3b7286c5d1fd58

improve gitignore

committed a year ago
Unverified
6323e00ec290ad4cbc5da326711921345437461f

try fix docs

committed a year ago
Verified
476f564eaed3c23938160941b70f1114b288f898

Merge pull request #22 from kabirkhan/hatchling/uv

kkabirkhan committed a year ago

README

The README file for this repository.

Recon

Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsitencies and get insights on improving the quality of your data.

PyPi Package version GitHub Actions Build badge Codecov badge PyPi Package license


Documentation: https://kabirkhan.github.io/recon

Source Code: https://github.com/kabirkhan/recon


Recon is a library to help you fix your annotated NER data and identify examples that are hardest for your model to predict so you can strategically prioritize the examples you annotate.

The key features are:

  • Data Validation and Cleanup: Easily Validate the format of your NER data. Filter overlapping Entity Annotations, fix missing properties.
  • Statistics: Get statistics on your data. From how many annotations you have for each label, to more complicated metrics like quality scores for the balance of your dataset.
  • Model Insights: Analyze how well your model does on your Dataset. Identify the top errors your model is making so you can prioritize data collection and correction strategically.
  • Dataset Management: Recon provides Dataset and Corpus containers to manage the train/dev/test split of your data and apply the same functions across all splits in your data + a concatenation of all examples. Operate inplace to consistently transform your data with reliable tracking and the ability to version and rollback changes.
  • Serializable Dataset: Serialize and Deserialize your data to and from JSON to the Recon type system.
  • Type Hints: Comprehensive Typing system based on Python 3.7+ Type Hints

Requirements

Python 3.7 +

Installation

$ pip install reconner
---> 100%
Successfully installed reconner

License

This project is licensed under the terms of the MIT license.