Category Archives: .NET

Get text from PDF file with iTextSharp

string ReadPdfFile(string filename) { var pdfText = new StringBuilder(); var reader = new PdfReader(filename); var pages = reader.NumberOfPages; for (int page = 1; page <= pages; page++) { var tes = new SimpleTextExtractionStrategy(); var pgText = PdfTextExtractor.GetTextFromPage(reader, page, tes); pdfText.Append(pgText); … Continue reading

Posted in .NET | Tagged , | Leave a comment

Lucene.Net Resources

Lucene.Net is a source code, class-per-class, API-per-API and algorithmatic port of the Java Lucene search engine to the C# and .NET platform utilizing Microsoft .NET Framework. Lucene in Action book, First and Second Edition Lucene Intro and QueryParser Rules by … Continue reading

Posted in .NET | Tagged | Leave a comment