StrepHit: Wikidata Statements Validation via References

“StrepHit (pronounced “strep hit”, means “Statement? repherence it!”)[1] is a Natural Language Processing pipeline that harvests structured data from raw text and produces Wikidata statements with reference URLs. Its datasets will feed the primary sources tool.[2]

In this way, we believe StrepHit will dramatically improve the data quality of Wikidata through a reference suggestion mechanism for statement validation, and will help Wikidata to become the gold-standard hub of the Open Data landscape….”