English NLP foundation

NSD-English-NLP-Foundation

영어 텍스트 처리·구조·담화·문법교정·근거검증 — 통합 마이크로서비스 스택.

English text processing, structure, discourse, grammar correction, grounding — integrated microservice stack.

영어 자유응답 분석 풀라인 — 7개 자체 컴포넌트가 단일 오케스트레이터 뒤에서 협력. 영어 토크나이저는 4-엔진 앙상블, 문법 교정은 LLM JSON-schema 강제 적용.

End-to-end English free-response analysis — 7 in-house components cooperating behind a single orchestrator. Tokenizer is a 4-engine ensemble; GEC enforces strict LLM JSON schema validation.

Mini-dashboard · indicators only

Verified signals. 추세·비교·상태만 — 절대 수치 일부는 공개 보류.

7

Components in production

tokenize · structure · discourse · connectors · dictionary · GEC · grounding

4-engine

Tokenizer ensemble

spaCy · Stanza · OpenNLP · Lucene paths

1

Unified orchestrator

single entry per analysis

Components in production

Independent microservices · single orchestrator front

Included NSD products

  • NSD-EnglishTokenizer-Ensemble-v3
  • NSD-EnglishStructureAnalyzer
  • NSD-EnglishDiscourse-Pipeline-v1
  • NSD-EnglishConnector-Analyzer-v1
  • NSD-English-IntDic
  • NSD-English-GEC-HyperEdit-v1
  • NSD-GroundingChecker-v0

7 in-house components · single unified orchestrator.