Sanae Fujita, Ph.D.

Senior Distinguished Researcher
Linguistic Intelligence Research Group
Innovative Communication Lab
NTT Communication Science Laboratories


Contact Information:
tessei.kobayashi.ga (at) hco.ntt.co.jp
2-4 Hikaridai, Seika, Souraku,
Kyoto 619-0237, JAPAN




Research Interests

  • Text Readability Estimation
  • Education Support
  • Word Sense Disambiguation
  • Valency Dictionary


Publication

  • Journal Paper
  • Conference Paper
  • Research Related Activities

Journal Paper

  1. Sanae Fujita, Takashi Hattori, Tessei Kobayashi, Yuko Okumura, Issho Aoyama. "Picture-book search system ``Pitarie'' - Finding appropriate books for each child -",
    Journal of Natural Language Processing, Vol. 24, No. 1, pp. 49--73. 2017.
    Online ISSN : 2185-8314, Print ISSN : 1340-7619 (in Japanese)

  2. Yuka Otake, Yuko Okumura, Gobara Akihiko, Naka Kyoko, Yonemitsu Fumiya, Sasaki Kyoshiro, Watanabe Naomi, Takashi Hattori, Sanae Fujita. "Supporting parent-child picture book reading with a search system at a public library,"
    Journal of The Science of Reading, 59(3), pp. 134--148, 2017.
    Online ISSN : 2424-144X, Print ISSN : 0387-284X (in Japanese)

  3. Sanae Fujita, Tessei Kobayashi, Yasuhiro Minami, Hiroaki Sugiyama. "Target Age Estimation of Texts for Children",
    Journal of Japanese Cognitive Science, (to appear). (in Japanese)

  4. Sanae Fujita, Hirotoshi Taira, Tessei Kobayashi, Takaaki Tanaka. "Japanese Morphological Analysis for Picture Books",
    Journal of Natural Language Processing, Vol. 21, No.3, pp. 515--540, 2014. (in Japanese)

  5. Sanae Fujita, Hirotoshi Taira, Masaaki Nagata. "Enriching Dictionaries with Images from the Internet",
    Journal of Natural Language Processing, Vol. 20, No.2, pp. 223--250, 2013. (in Japanese) (Best paper award)

  6. Sanae Fujita, Akinori Fujino. "Word Sense Disambiguation by Combining Labeled Data Expansion and Semi-Supervised Learning Method",
    Transactions on Asian Language Information Processing, Association for Computing Machinery (ACM),
    Volume 12 Issue 2, June 2013. Article No. 7. DOI: 10.1145/2461316.2461319

  7. Sanae Fujita, Kevin Duh, Akinori Fujino, Hironori Taira, Hiroyuki Shindo.
    "Effectiveness of Automatic Expansion of Training data for Japanese Word Sense Disambiguation",
    Journal of Natural Language Processing, Vol. 18, No. 3, pp. 273--292, 2011. (in Japanese)

  8. Sanae Fujita, Francis Bond, Stephan Oepen, Takaaki Tanaka.
    "Exploiting Semantic Information for HPSG Parse Selection",
    Research on Language and Computation, Springer Netherlands.
    Vol. 8, No. 1, pp. 1--22, DOI: 10.1007/s11168-010-9069-7, 2009.

  9. Timothy Baldwin, Su Nam Kim, Francis Bond, Sanae Fujita, David Martinez and Takaaki Tanaka.
    "A Reexamination of MRD-based Word Sense Disambiguation",
    Transactions on Asian Language Information Process, Association for Computing Machinery (ACM), Vol. 9, No. 4, pp. 1--21, 2010. ISSN:1530-0226

  10. Sanae Fujita, Francis Bond
    A Method of Creating New Valency Entries
    Machine Translation Journal, Springer Netherlands
    https://www.e-proof.sps.co.in/springer-ny/ja.asp?rfp=eyippfanae , 2008.6.28
    ISSN:922-6567 (Print), 1573-0573 (Online)

  11. Francis Bond, Sanae Fujita and Takaaki Tanaka
    The Hinoki Syntactic and Semantic Treebank of Japanese,
    Asian Language Technology: Resources and Processing A Special Double-Issue of Language Resources and Evaluation, Vol. 40, No.3-4, pp. 253-261, 2006

  12. Sanae Fujita, Francis Bond
    "An investigation into the nature of verbal alternations and their use in the creation of bilingual valency entries."
    Journal of Natural Language Processing, Vol. 12, No. 3, pp. 67--89, 2005. (in Japanese)

  13. Francis Bond, Sanae Fujita, Chikara Hashimoto, Kaname Kasahara, Shigeko Nariyama, Eric Nichols, Akira Otani, Takaaki Tanaka, Shigeaki Amano
    "The Hinoki Treebank" A Treebank for Text Understanding
    Natural Language Processing (IJCNLP-04), Lecture Notes in Computer Science,
    Springer Verlag, Vol. 3248, pp. 158--167, 2005.

Conference Paper

  1. Chifumi Nishioka, Sanae Fujita, Takashi Hattori, Tessei Kobayashi, Futoshi Naya, Hiroaki Ogata. "A Picture-Book Recommender System for Extensive Reading on an E-Book System", In The 10th International Learning Analytics & Knowledge Conference (LAK-20),
    2020, 3/23--27.

  2. Naomi Watanabe, Takashi Hattori, Sanae Fujita, Yuko Okumura, Tessei Kobayashi. "Children's storybooks present a wider variety of emotion words than parents' emotion talk: Investigating 5,000 storybooks," In Society for Research in Child Development. Biennial Meeting (SRCD-2019) ,
    2019, 3/21--23.

  3. Sanae Fujita, Tessei Kobayashi, Yuko Okumura and Takashi Hattori.
    "Investigation of the relationship between words in picture books and child vocabulary acquisition: Recommending picture books with suitable readability,"
    In 27th European Early Childhood Education Research Association Annual Conference (EECERA-2017),Boronya, Itary
  4. Megumi Yasuo, Mitsunori Matsushita, Takashi Hattori, Sanae Fujita. "Estimating Story Development Similarity for Picture Books," In CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), DECEMBER 1-3, 2017, TAIWAN.

  5. Yuko Okumura, Yuka Ohtake, Akihiko Gobara, Kyoshiro Sasaki, Fumiya Yonemitsu, Kyoko Naka, Naomi Watanabe, Sanae Fujita, Takashi Hattori, Yuki Yamada and Tessei Kobayashi.
    "The effect of an intervention assisting mothers with picture book search,"
    In 27th European Early Childhood Education Research Association Annual Conference (EECERA-2017),

  6. Takashi Hattori, Tessei Kobayashi, Sanae Fujita, Yuko Okumura, Kazuo Aoyama, "Pitarie: a system to find picture books that match children's ages and interests",
    In 27th European Early Childhood Education Research Association Annual Conference (EECERA-2017),

  7. Atsuko Saito, Yuka Fujimoto, Tessei Kobayashi, Yuko Okumura,Takashi Hattori, Sanae Fujita and Naomi Watanabe.
    "Selecting appropriate picture books for children: Implementation of ''Pitarie'', a search system for picture books at a pre-school teacher preparation program,"
    In 27th European Early Childhood Education Research Association Annual Conference (EECERA-2017),

  8. Sanae Fujita, Tessei Kobayashi, Yuko Okumura and Takashi Hattori.
    "How do words in picture books affect child vocabulary acquisition? - An analysis of large-scaled corpus in Japanese picture books -,"
    In Workshop on Infant Language Development (WILD-2017), Bilbao, Spain
  9. Yuko Okumura, Tessei Kobayashi, Sanae Fujita and Takashi Hattori.
    "Why is shared book reading efective for children's Theory of Mind development? : Frequency analysis of cognitive mental state terms in Japanese picture books."
    In International Conference of Language Acquisition (ICLA-2016), Spain.

  10. Sanae Fujita and Akinori Fujino.
    Word Sense Disambiguation by Combining Labeled Data Expansion and Semi-Supervised Learning Method,
    In The 5th International Joint Conference on Natural Language Processing (IJCNLP-2011), 2011

  11. Sanae Fujita and Masaaki Nagata.
    Enriching Dictionaries with Images from the Internet - Targeting Wikipedia and a Japanese Semantic Lexicon: Lexeed -,
    In The 23rd International Conference on Computational Linguistics (Coling-2010), pp. 331--339, 2010.

  12. Sanae Fujita, Kevin Duh, Akinori Fujino, Hirotoshi Taira and Hiroyuki Shindo.
    MSS: Investigating the Effectiveness of Domain Combinations and Topic Features for Word Sense Disambiguation.
    In the 5th International Workshop on Semantic Evaluation (SemEval-2010), pp. 383--386, 2010, Uppsala, Sweden.

  13. Hirotoshi Taira, Sanae Fujita, Masaaki Nagata.
    Predicate Argument Structure Analysis using Transformation based Learning,
    In the 48th Annual Meeting of the Association for Computational Linguistics (ACL-2010), pp. 2010, Uppsala, Sweden.

  14. Hirotoshi Taira, Sanae Fujita, and Masaaki Nagata. A Japanese Predicate Argument Structure Analysis using Decision Lists. In Empirical Methods in Natural Language Processing Conference on Computational Natural Language Learning (EMNLP-2008), pp. 522--531, 2008.

  15. Timothy Baldwin, Su Nam Kim, Francis Bond, Sanae Fujita, David Martinez and Takaaki Tanaka.
    MRD-based Word Sense Disambiguation: Further Extending Lesk
    The Third International Joint Conference on Natural Language Processing (IJCNLP-2008), Hyderabad, India, pp. 775--780

  16. Sanae Fujita, Francis Bond, Stephan Oepen and Takaaki Tanaka.
    Exploiting Semantic Information for HPSG Parse Selection. ACL 2007 Workshop on Deep Linguistic Processing, June, Association for Computational Linguistics, Prague, Czech Republic, pp.25--32,

  17. Takaaki Tanaka, Francis Bond, Timothy Baldwin, Sanae Fujita and Chikara Hashimoto.
    Word Sense Disambiguation Incorporating Lexical and Structural Semantic Information. Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), pp.477--485,

  18. Sanae Fujita, Takaaki Tanaka, Francis Bond, and Hiromi Nakaiwa.
    An implimented description of Japanese: The Lexeed dictionary and the Hinoki treebank.
    In 44th Annual Meeting of the Association for Computational Linguistics and 21st International Conference on Computational Linguistics (COLING/ACL-2006) Interactive Presentation,
    Sydney, Australia, 2006, 7/17--21, pp.65--68,

  19. Eric Nichols, Francis Bond, Takaaki Tanaka, Fujita Sanae and Dan Flickinger
    Multilingual Ontology Acquisition from Multiple MRDs.
    In 2nd Workshop on Ontology Learning and Population (OLP2),
    Sydney, Australia, 2006, 7/22, pp. 10--17,

  20. Takaaki Tanaka, Francis Bond and Sanae Fujita
    The Hinoki Sensebank -- A Large-Scale Word Sense Tagged Corpus of Japanese --.
    A Merged Workshop with 7th International Workshop on Linguistically Interpreted Corpora (LINC-2006) and Frontiers in Corpus Annotation III,
    Sydney, Australia, 2006, 7/22

  21. Takaaki Tanaka and Francis Bond and Stephan Oepen and Sanae Fujita
    High Precision Treebanking -- Blazing Useful Trees Using POS Information
    the Association for Computational Linguistics 43rd Annual Meeting (ACL-2005),
    Michigan, USA, 2005, 6/25--6/30, pp. 62--69,

  22. Sanae Fujita, Francis Bond
    An Automatic Method of Creating New Valency Entries using Plain Bilingual Dictionaries
    the 10th International Conference on Theoretical and Methodological Issues in Machine Translation (TMI-2004),
    Baltimore, MD USA,2004,10/4--10/6

  23. Francis Bond, Eric Nichols, Sanae Fujita, Takaaki Tanaka
    Acquiring an Ontology for a Fundamental Vocabulary
    the 20th International Conference on Computational Linguistics (COLING-2004),
    Geneva, Switzerland,2004,8/23--8/29

  24. Sanae Fujita, Francis Bond
    A Method of Creating New Bilingual Valency Entries using Alternation
    the 20th International Conference on Computational Linguistics (COLING-2004),
    Workshop, Multilingual Linguistic Resources (MLR-2004),
    Geneva, Switzerland, 2004, 8/23--8/29

  25. Francis Bond, Sanae Fujita, Chikara Hashimoto, Kaname Kasahara, Shigeko Nariyama, Eric Nichols, Akira Otani, Takaaki Tanaka, Shigeaki Amano
    The Hinoki Treebank: Working Toward Text Understanding
    the 20th International Conference on Computational Linguistics (COLING-2004),
    the 5th International Workshop on Linguistically Interpreted Corpora (LINC-04),
    Geneva, Switzerland,2004,8/23--8/29

  26. Francis Bond, Sanae Fujita, Chikara Hashimoto, Kaname Kasahara, Shigeko Nariyama, Eric Nichols, Akira Otani, Takaaki Tanaka, Shigeaki Amano
    "The Hinoki Treebank" A Treebank for Text Understanding
    The 1st International Joint Conference on Natural Language Processing (IJCNLP-04)
    Sanya City, Hainan Island, China, 2004, 03/22--03/24
  27. Francis Bond, Sanae Fujita
    Evaluation of a Method of Creating New Valency Entries.
    Machine Translation Summit IX (MT Summit-2003), New Orleans, Louisiana, USA, 2003, 9/23--9/27, pp.16--23

  28. Sanae Fujita, Francis Bond
    Extending the Coverage of a Valency Dictionary.
    the 19th International Conference on Computational Linguistics (COLING-2002), Taipei, Taiwan, 2002, 8/24--9/1
    Workshop, Machine Translation in Asia

  29. Sanae Fujita, Francis Bond
    A Method of Adding New Entries to a Valency Dictionary by Exploiting Existing Lexical Resources.
    The 9th International Conference on Theoretical and Methodological Issues in Machine Translation, Keihanna, Japan, 2002, 3/13--3/17

Research Related Activities

  1. 2020: NLP-2020 Program Committee
  2. 2016: IJCAI Program Committee
  3. 2008:Local Committee: Deep Linguistic Processing with HPSG (DELPH-IN) Summit
  4. 2008:Local Committee: The 15th International Conference on Head-Driven Phrase Structure Grammar (HPSG 2008)
  5. 2007:Program Committee: PACLING 2007: Conference of the Pacific Association for Computational Linguistics
  6. 2006:Referee: Language Resources and Evaluation
  7. 2004:Program Committee: MLR-2004: PostCOLING Workshop on Multilingual Linguistic Resources
  8. 2003:Referee: MT Summit IX, 情報処理学会誌(臨時)
  9. 2002:Local Committee: the conference on Theoretical and Methodological Issues in MT (TMI-2002),2001.12.12--2002.4.1