.NET Tutorials, Forums, Interview Questions And Answers
Welcome :Guest
Sign In
Win Surprise Gifts!!!

Top 5 Contributors of the Month
david stephan
Gaurav Pal
Post New Web Links

SharePoint Document Parser

Posted By:      Posted Date: August 26, 2010    Points: 0   Category :SharePoint


I have a requirement to parse an email address from a document when added to a document library, unfortunately, the document's are not formatted in any standard fashion, although they are all word documents which is an advantage.

I have read that SharePoint has a word document parser out of the box, so am wondering whether it can be configured to parse email addresses as meta data? Otherwise, I can write a rundimentary parser myself using regular expressions, and plug this into SharePoint, however, have no idea where to start.

Any help would be greatly appreciated.

Thanks in advance,


View Complete Post

More Related Resource Links

Working with SharePoint document libraries

Document libraries are collections of files that you can share with team members on a Web based on Microsoft Windows SharePoint Services. For example, you can create a library of common documents for a project, and team members can use their Web browsers to find the files, read them, and make comments. Users with Microsoft Office 2003 can check out and edit the files as if they resided on a local or network drive.

By default, your team Web site comes with a built-in document library named Shared Documents, which is listed on the Quick Launch bar as well as on the Documents and Lists page.

SharePoint Document Workspaces for Developers

Office 2003 is being released this month, and a cast of supporting applications is already available. One of these is Windows SharePoint Services 2.0, the free team add-on for Windows Server 2003. If you're not familiar with SharePoint Services basics, take a look at What Developers Need to Know About Windows SharePoint Services. In this article, I'm going to drill into one key place where SharePoint and Office 2003 overlap: document workspaces

Upload a File to a SharePoint Document Library - Part I

The following helper class demonstrates a few techniques that allow documents to be uploaded to a SharePoint document library programmatically without using the API or a custom web service. You don't need to specify a document library name, and it will create any folders specified in the URL as required. File meta data will be updated if any properties are passed.

SharePoint document migration challenges when migrating files and folders

There are several challenges when migrating documents to Microsoft SharePoint. While these challenges can be overcome, they are a real pain if the migration source, content and file systems are not SharePoint friendly.

SharePoint: Add a Recycle Bin to Windows SharePoint Services for Easy Document Recovery


Windows SharePoint Services helps improve collaboration and workflow while protecting documents and intellectual property. But one feature is conspicuously missing - an easy way to back up and restore deleted files from document libraries. In this article, the authors take advantage of the extensibility of WSS and its server-side and client-side object models to build a restore feature that works like the Recycle Bin in Windows Explorer.

Maxim V. Karpov and Eric Schoonover

MSDN Magazine February 2005

Approve/Reject a document using Sharepoint web services


Is there a way to approve / reject a file that has been published on sharepoint 2007 using web services only?

Say there is a document 'A.xls' that was published as a major version by the author. I want the admin to be able to approve/reject it from a site outside the Sharepoint application. I know there is a workflow involved in this, but is there a web service that exposes the functionalities attributed to this workflow?


Can I open the "new" menu in a SharePoint Document Library using Dragon NaturallySpeaking?


When a user, who uses Dragon NaturallySpeaking, says "new" when viewing a Document Library the default content type is selected and the associated application (e.g. Microsoft Word) is launched. This scenario is okay for Content Type selection since the Microsoft Office applications will prompt for the metadata once the file is saved.

If the user needs to create new folders in the Document Library however they need to expand the "new" dropdown menu.

What is the best way for someone using Dragon NaturallySpeaking to expand the "new" menu for a Document Library in SharePoint?

Sharepoint 2007 Document Search and Highlight in Office 2007 and PDF


We will be implementing Document Search (DOC, DOCX, PDF, TIFF, PPT, PPTX, XLS, XLSX) for one of our customers. We were evaluating a few MOSS 2007 based solutions, and happened to visit your site.  What we intend to achieve are the following functionalities:


a.       User should be able to search through 1000s of documents which are in a Sharepoint Document Library

b.      The documents could be in English, International Languages.

c.       Each document is 70-100 pages, and therefore it is important that the Search Results show the relevant pages on which the Search Text appears (Get Highlighted) within each document . Clicking on the link will take the user to the right page.

d.      The additional requirement is Synonym search. So, I can search for

File Attachments from info-path forms to Sharepoint document library


Hi Experts,

How to upload file attachements from info-path forms to Sharepoint document library?

I need to add attachement from local-PC/Any User to Sharepoint Document Library.


Any help please?? Appreciated




If you have SharePoint Document Versioning Enabled, do you also use the retention settings?

I am looking for stats on how many environments are out there using document versioning in there libraries but are perhaps not selecting the retention settings. (e.g. "Optionally limit the number of versions to retain" section) This as we all know in SharePoint is a major cause of database growth.
Tony Parker, MSCE . MCTP. MCITP "Anything worth doing, is worth doing right"

Video: SharePoint 2010: Advanced Routing and Document Sets

Learn how to work with document sets such as RPFs (Request for Proposal), which may be made up of multiple documents such as the question document, the answer document, and many other supporting documents and files. (Length: 13:56)

Video: SharePoint 2010 Document Libraries

This demonstration delves into working with document libraries and documents programmatically. (Length: 5:39)

question on attachments/document upload in SharePoint

Scenario: I have business need where I need to attach some documents to a document library. My users fill out a form, which is a custom List Form in a list called "Request" but on the same form I want to provide my users a document upload feature which will save the document to a document library called "Request Documents".

In other words, request information such as Title, Department, Description etc get saved to "Request List" (SharePoint List) whereas the documents attached/uploaded (if there are any from the user) while filling the same request form out, get saved in the "Request Documents" (SharePoint Document Library) with the same ID #. If Request ID was 100, then the uploaded document(s) in the Document Library should also be 100.

I tried a few techniques out, and I'm reading some more on Data Source Library of SharePoint but I believe I need some help from the community before I do anything further.

SharePoint makes Office Document custom fields unreadable


We're using WSS 3.0SP2 with a Document Library that has document-level columns (fields) added. The columns (fields) are defined as single-line text; nothing fancy. In SharePoint we can set the field values no problem. When we pull the documents back out of SharePoint to the file systsem, and look at the Custom Fields, the values are unreadable gibberish. They look like graphics lines and boxes and the Type is set to Unknown.

Any ideas why this is happening and how to fix it?

SharePoint Document Libraries and Horrors Oh My

I thought I would capture a few ideas around document libraries and share with the rest of the class. These are not "best practices" as I don't want to sound too preachy so let's call them "pretty good practices that you might want to consider if you have some time" (which is far too long for a blog title, hence the one I came up with is going to have to do). SharePoint Fight Club The first rule of SharePoint. I want you to repeat after me. "SharePoint Document Libraries are not file shares." Good. Now say it again. Good. Remember this, above all other things we're going to talk about here, and you'll be golden. Document Libraries are not File Shares I have to repeat this here because it is key. If your users are asking you to put a file share into SharePoint then you need to beat them upside the head when they talk about "replicating the folder structure" or "like for like" or "make it just like *that* (and points to file share)". If you really must make some kind of analogy and your audience knows something about databases (even Access), rather than saying document libraries are like file shares say "document libraries are like databases". While SharePoint sits on top of a database it's kind of irrelevant what the backing store is (side note, Dear Microsoft, please make a pluggable store for SharePoint in the next version) a list or document library is pretty

Sharepoint Workflow To Shoot Email Only When Folder Is Created In Document Library

Hi, Can anyone help me with the below issue, I am facing this from so long and didn’t figure out solution yet for the same. :( I want to send an email through SharePoint designer workflow only when a folder is created in document library (not file uploaded/updated event). Please provide me solution about how I can achieve this through SharePoint designer as don’t have enough coding acquaintance.

The URL 'ao/AO Document/Admin Guideline/xxxxx.jpg' error on 2nd SharePoint server in farm

We just added a 2nd SP2007 server to mirror our site.  The original site is in China, the new site in the US.  They are connected via a permanent  site to site VPN.  We are hitting the errors when attempting to upload files to the US mirror.    Environment: US server site: OS: Windows Server 2008 R2 Enterprise (64 bit) APP: Windows Sharepoint Server 2007 Enterprise SP2 (64 bit) DB: Windows SQL Server 2008 Enterprise R2 (64 bit) China site: OS: Windows Server 2003 Enterprise SP2 (32 bit) APP: Windows Sharepoint Server 2007 Enterprise SP2 (32 bit) DB: Windows SQL Server 2008 Enterprise R2 (32 bit) Background: We setup two test Sharepoint farm to be syncronized with only its content database. The upload works good on SZ site. But we got the following 2 error messages randomly when uploading file to document library of SF side. Error message: 1. The URL 'ao/AO Document/Admin Guideline/xxxxx.jpg' is invalid.  It may refer to a nonexistent file or folder, or refer to a valid file or folder that is not in the current Web. 2. Cannot open file "ao/AO Document/Admin Guideline/xxxxx.jpg". -- We've tried the official hotfix, altering the schema, changing the IIS memory limit settings, and restarting the application pool. Could the issue be related to the different OSes 2008 vs 2003, or 32 vs 64bit?  
ASP.NetWindows Application  .NET Framework  C#  VB.Net  ADO.Net  
Sql Server  SharePoint  Silverlight  Others  All   

Hall of Fame    Twitter   Terms of Service    Privacy Policy    Contact Us    Archives   Tell A Friend